Gene NATL1_00221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00221 
Symbolgap2 
ID4780780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp24726 
End bp25748 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content39% 
IMG OID640083285 
Productglyceraldehyde 3-phosphate dehydrogenase(NADP+)(phosphorylating) 
Protein accessionYP_001013851 
Protein GI124024735 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTGC GTGTAGCGAT TAATGGATTC GGAAGAATTG GACGCAATTT TATGCGTTGT 
TGGCTCAGTA GGGGTGCAAA TACAAATATT GAGGTGGTCG GCATTAACGT CACTTCTGAT
CCAAAAACTT GTGCACATTT GCTCAAATAT GACTCTATTT TAGGTGCTAT AAAAGACGCC
CAAATTTCAC ATACGGACGA TACATTTCAA ATTAATGGCA AAACTATAAA ATGTTATTCA
GATAGAAACC CTTTAAATCT TCCTTGGAAA GAGTGGGGAA TTGATTTAGT AATTGAGTCA
ACTGGTGTAT TCAATACAGA TGTTGGTGCT AGTAAGCATC TACAAGTTGG TGCTAAGAAG
GTCATTCTTA CTGCACCTGG GAAGGGTGAT GGTGTAGGTA CTTATGTGGT TGGTGTTAAC
GCTGATTCAT ACTCACATGA AGATTTTGAT ATCCTTAGCA ACGCAAGTTG CACCACTAAT
TGTTTAGCGC CAATCGTAAA AGTTTTAGAT CAAAAGTTGG GGATTAATAA AGGTTTAATG
ACCACGATTC ATAGTTATAC GGGAGATCAA AGAATTCTTG ACAATGCTCA TCGTGATTTA
CGTCGCGCAA GAGCAGCAGC AATGAATTTG GTCCCTACTT CAACTGGAGC GGCAAAGGCT
GTTGCTCTTG TTTATCCACA AATGAAAGGG AAACTAACTG GTATTGCGAT GCGAGTCCCT
ACTCCTAATG TTTCTGCGGT TGATTTGGTT TTTGAATCAG GACGTAAAAC TAGTGCTGAA
GAGGTCAATT CATTACTTAA AACCGCTTCA CAGGGAGAAA TGAAAGGAAT CATTAAATAT
GGTGATTTGC CTCTGGTTTC TACTGACTAT GCGGGAACGA ATGAATCAAC CATTGTTGAT
GAAGCATTAA CAATGTGCAT CGATGACAAT ATGGTGAAAG TTTTAGCTTG GTATGACAAT
GAGTGGGGTT ACAGTCAAAG GGTTGTTGAT TTGGCTGAAA TTGTTGCTCA GAAATGGAAG
TAA
 
Protein sequence
MTLRVAINGF GRIGRNFMRC WLSRGANTNI EVVGINVTSD PKTCAHLLKY DSILGAIKDA 
QISHTDDTFQ INGKTIKCYS DRNPLNLPWK EWGIDLVIES TGVFNTDVGA SKHLQVGAKK
VILTAPGKGD GVGTYVVGVN ADSYSHEDFD ILSNASCTTN CLAPIVKVLD QKLGINKGLM
TTIHSYTGDQ RILDNAHRDL RRARAAAMNL VPTSTGAAKA VALVYPQMKG KLTGIAMRVP
TPNVSAVDLV FESGRKTSAE EVNSLLKTAS QGEMKGIIKY GDLPLVSTDY AGTNESTIVD
EALTMCIDDN MVKVLAWYDN EWGYSQRVVD LAEIVAQKWK