Gene NATL1_08201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08201 
Symbol 
ID4779987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp754079 
End bp755146 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content42% 
IMG OID640084095 
Productfructose-1,6-bisphosphate aldolase 
Protein accessionYP_001014643 
Protein GI124025527 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3588] Fructose-1,6-bisphosphate aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTCT CGTACTACGC AGAAGAACTA AAGAAGACCG CAAGTGCCAT AGCCCAGCCA 
GGCAAAGGGA TTCTTGCTGT TGACGAGTCA ACGAAAACAG TAGGTAAAAG GCTTGCTTCA
ATAGGTGTTG AGAACACTGA GGACAACAGA AAAGCATATA GAGGTATGCT TTTCACCACA
GAAGGTCTTG GAAACTTCAT AAGCGGAGCG ATTCTTTTTG AAGAAACTCT TTTCCAGAAC
CATCCAGATG GTGAGCCAAT GGTTAAAAAG CTTGAGAAGC TAGGCATAAT TCCAGGAATC
AAAGTTGACA AAGGTCTAAG ACCATTAGCC GGTGGACATG ATGTAGAAAC TTTTTGTTCA
GGTTTAGACG GTCTTGTTGA AAGAGCTGCT GATTATTACG AGCAAGGTGC AAGATTTGCC
AAGTGGAGAG CAGTACTTCA AATAACAGAC GATGGTTGTC CTTCTAAACT TTCTATTAGA
GAAAATGCTT GGGGTTTAGC AAGATACGCT AGATCAGTTC AAGAATCTGG CTTGGTTCCA
ATTATTGAAC CAGAAATCTT AATGGATGGT TCACATTCAA TTGAAAAGAC AGCAGCAGTT
CAAGAAGAAG TAATTAAAGA AGTTTACTTA GCTTGCCAGG TAAATGGAGT ACTTCTAGAG
GGAACTCTTC TGAAGCCATC AATGACTGTT CAAGGTGCTG ACAGCTCAAC AAAAGCTGAT
CCTCAGCAAG TAGCTGAAAT GACAATCCGT ACAATGGAAC GCTGTGTACC TGCAAGTGTC
CCTGGTATTA CTTTCCTTTC AGGTGGATTG AGCGAGGAAG CTGCATCAGT TTATCTAAAT
CTGATGAATA AGATCGACAG AAAGGCTAAG TGGAATGTTT CATTCTCATA TGGTCGTGCT
TTACAACATT CATGTCTAAA AGCATGGAAA GGCTCGAACA CTTCTGATGG ACAAAAAGCA
CTCATAGCTA GAGCTCAGGC AAACTCTGAG GCATCAAAAG GATTGTATGT TGCTGGTTCT
CAGCCTTCTT CTGATGAGCA ATTATTTGTA GCTGGATACA AGTACTAA
 
Protein sequence
MALSYYAEEL KKTASAIAQP GKGILAVDES TKTVGKRLAS IGVENTEDNR KAYRGMLFTT 
EGLGNFISGA ILFEETLFQN HPDGEPMVKK LEKLGIIPGI KVDKGLRPLA GGHDVETFCS
GLDGLVERAA DYYEQGARFA KWRAVLQITD DGCPSKLSIR ENAWGLARYA RSVQESGLVP
IIEPEILMDG SHSIEKTAAV QEEVIKEVYL ACQVNGVLLE GTLLKPSMTV QGADSSTKAD
PQQVAEMTIR TMERCVPASV PGITFLSGGL SEEAASVYLN LMNKIDRKAK WNVSFSYGRA
LQHSCLKAWK GSNTSDGQKA LIARAQANSE ASKGLYVAGS QPSSDEQLFV AGYKY