Gene NATL1_11571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_11571 
Symbolgap3 
ID4780794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1033950 
End bp1034975 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content38% 
IMG OID640084436 
Productputative glyceraldehyde 3-phosphate dehydrogenase 
Protein accessionYP_001014980 
Protein GI124025864 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0636891 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATTG GTATTAATGG TTTTGGCCGT ATAGGACGAC TTGTTCTTAG AGCACTCTGG 
GATAGGGAAA ATATCGAGAT TTCGCATATC AACGATCCAT TTGGTGATGC AAAGGGAGCT
GCGCATTTAC TTGAATATGA CTCAGTTCAT GGTCGTTGGA ACAAAGCAAT AAGCAACGAC
CAAAACAACC TGAGTATTGA AGGTCAGCCA ATATCTTTTT CTCAAGAAAG TGATTACACA
AAAGTGCCAT GGAATGAAAA AGGTATAGAG CTAATTCTCG AATGCTCAGG AAAATTCAAA
ACTCCTCAAA CATTAAATCC TTATTTTGAT ACTCTTGGGA TGAAAAGAGT TGTCGTTGCA
TGTCCAGTAA AAGGATCCAT CCAGGGAGAG GATACTCTAA ATATCGTCTA CGGTATTAAT
CATGATTTAT ATGAGCCCAA TAAACATCGC TTAGTAACAG CTGCATCCTG CACAACTAAT
TGCTTAGCTC CCGTTGTGAA AGTTGTTAAT CAAGCTTTTG GTATAAAGCA TGGAAGCATC
ACAACACTTC ATGATTTAAC AAATACACAG GTAATTGTTG ATTCATTTAA ATCAGATTTA
AGAAGAGCAA GGAGCGGATC ACAAAGCTTA ATTCCAACAA CAACAGGATC AGCAAAAGCG
ATAGGGATGA TATTCCCAGA ATTACAAGGA AAATTAAATG GCCATGCAGT TCGAGTCCCT
CTCCTCAATG GATCTTTAAC TGATGCTGTA TTTGAATTAG AGAAAGAGGT CACGCAAGAA
GAAGTTAATC ATGTGTTCAA AGAAGCTTCA GAAGGAGAGC TAAAAGGAAT CCTTGGTTAC
GAAGAAAAAC CACTTGTCTC AATTGATTAT GTCAATGACT CAAGGAGTTC AATCATAGAT
GCGCCATCAA CCATGGTGAT CAATAAATCT CAATTGAAAG TCTATATTTG GTATGACAAT
GAATGGGGTT ATAGCTGTCG AATGGCAGAT CTCGTCTGCC ATGTCATAAA TCTTGAAAAG
GATTAA
 
Protein sequence
MRIGINGFGR IGRLVLRALW DRENIEISHI NDPFGDAKGA AHLLEYDSVH GRWNKAISND 
QNNLSIEGQP ISFSQESDYT KVPWNEKGIE LILECSGKFK TPQTLNPYFD TLGMKRVVVA
CPVKGSIQGE DTLNIVYGIN HDLYEPNKHR LVTAASCTTN CLAPVVKVVN QAFGIKHGSI
TTLHDLTNTQ VIVDSFKSDL RRARSGSQSL IPTTTGSAKA IGMIFPELQG KLNGHAVRVP
LLNGSLTDAV FELEKEVTQE EVNHVFKEAS EGELKGILGY EEKPLVSIDY VNDSRSSIID
APSTMVINKS QLKVYIWYDN EWGYSCRMAD LVCHVINLEK D