Gene NATL1_21901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21901 
SymbolargG 
ID4779237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1850368 
End bp1851570 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content38% 
IMG OID640085488 
Productargininosuccinate synthase 
Protein accessionYP_001016010 
Protein GI124026895 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.898377 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAGG CTAAAAAAGT TGTTTTGGCT TATTCAGGAG GGGTTGATAC TAGCGTTTGT 
ATTCCTTACT TGAAGAAGGA ATATGGAGTC GAGCATGTAA TTGCATTTGC AGCAGATCTT
GGTCAAGGCG ACGAGCTTGA TCAAATCAAA CAAAAGGCTA TTTCAGCAGG AGCTTCAGAA
TCTTTAATTG GCAATTTGGT AAAGCCTTTC ATAGAAGATT TTGCTTTCCC AGCGATTAGA
TCTAATGCTT TGTATCAAGG TAGATATCCT CTTTCAACGG CATTAGCTAG ACCATTAATT
GCAAAAAACC TTGTTGAAAT TGCAAGGGAA CTAAATGCCG ATGGAGTCGC TCACGGGTGT
ACGGGCAAAG GGAACGATCA AGTGCGTTTT GATGTGACTA TTGGAGCTTT AGCTCCTGAT
TTGCAATTGC TTACACCAGC ACGGGAATGG GGTATGAGCC GCGAAGAAAC CATTGCCTAT
GGAGAAAAAT ATGGAATAGT TCCCCCTGTA AGTAAAAAAA CTCCTTACTC GATTGATTTG
AATCTTTTGG GGAGAAGTAT TGAAGCTGGT CCTCTTGAAG ATCCATTTGA GATGCCATCA
GAAGAAGTGT TTGGCATCAC TTCTTCTATA GCTGATTCAC CAAACGAGCC TGAGATAGTA
GATATTTTGT TTGAAAATGG TTATCCAGTT GCAATTAATG GAGAAGCGAT GGAGCCAGTA
TCCCTGATTA AAAAAGCTAA TAGCCTTGCA GGAAAGCATG GCTTTGGACG TTTGGATATT
ATTGAAGACA GAGTAGTAGG AATTAAAAGT CGAGAAATTT ATGAAACTCC AGGATTGCTT
TTATTAATTA AAGCTCATCA GGAAATTGAG AGTTTAACTT TACCTGCCGA CTTATTAGAT
ACTAAATTTA GACTCGAACG ACAATGGGCA GACTTGGTTT ATAAAGGTTT TTGGTTTAGT
CCTCTAAAAG AAGCTTTGGA TGGATTTATT AATTATTCTC AAAAGCAAGT GAATGGAACA
GTCAGGGTTA GGCTTTTTAA GGGTAATGTC GATGTTATAG GTCGCAAGTC AAAAGAAAAT
AGTTTGTATA TTTCAGATAT GTCTACTTAT GGAAGTGAGG ATAAGTTCAA TCACAAATCC
GCTGAAGGAT TTATATATGT ATGGGGATTG CCTAGTCGAA TTTGGTCTTG GATAAACAAG
TAA
 
Protein sequence
MGKAKKVVLA YSGGVDTSVC IPYLKKEYGV EHVIAFAADL GQGDELDQIK QKAISAGASE 
SLIGNLVKPF IEDFAFPAIR SNALYQGRYP LSTALARPLI AKNLVEIARE LNADGVAHGC
TGKGNDQVRF DVTIGALAPD LQLLTPAREW GMSREETIAY GEKYGIVPPV SKKTPYSIDL
NLLGRSIEAG PLEDPFEMPS EEVFGITSSI ADSPNEPEIV DILFENGYPV AINGEAMEPV
SLIKKANSLA GKHGFGRLDI IEDRVVGIKS REIYETPGLL LLIKAHQEIE SLTLPADLLD
TKFRLERQWA DLVYKGFWFS PLKEALDGFI NYSQKQVNGT VRVRLFKGNV DVIGRKSKEN
SLYISDMSTY GSEDKFNHKS AEGFIYVWGL PSRIWSWINK