Gene A9601_18201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_18201 
SymboltktA 
ID4718557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1552173 
End bp1554179 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content38% 
IMG OID640079553 
Producttransketolase 
Protein accessionYP_001010210 
Protein GI123969352 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGCTG CATCTGTTTC ATTAGAATCA CTTTGTGTAA ATAGTATAAG AATGCTTGCT 
GTAGATGCAG TAAATAAATC TAATAGTGGT CATCCTGGAT TGCCAATGGG ATGTGCACCT
ATGGGTTATG CATTATGGCA AAACATACTT AATCACAACC CTAACAACCC TAAATGGTTC
AATAGAGACC GTTTTGTTTT ATCAGCTGGT CATGGCTGTA TGCTGTTGTA TTCCTTGCTT
CATTTGACAG GATATAAATC AGTTTCCATA GAAGATATTA AAGAATTTAG GCAATGGGGA
TCAAAAACTC CTGGACATCC AGAAACATTC GAAACTGAAG GTGTTGAAGT TACAGCTGGG
CCTCTTGGAG CAGGAATTTC AAATGCAGTT GGTTTAGCAA TAGCTGAAAC ACACTTAGCA
GCTAAATTTA ATAAGCCTGA TTGCAATATT GTTGATCACT ATACTTACGT AATAATGGGT
GATGGCTGTA ATCAAGAAGG TATCGCATCA GAGGCCTGCT CATTAGCTGG TCATCTTAAG
CTTGGGAAAT TAATTGCACT TTATGACGAT AATCAAATTA CAATTGATGG ACGGACCGAC
GTTTCTTTTA CTGAAGATGT CTTAAAAAGA TACGAAGCTT ATGGATGGCA TGTGCAACAT
GTTGAAGATG GGAATCATGA TGTTAAAGGA ATCACCGAAG CTATCGAAAA AGCGAAATTA
ATTACAGACA AGCCTTCAAT TATAAAAATT TCTACAACCA TAGGTTACGG TTCTCCAAAT
AAATCAGATA CTGCTGGAAT TCATGGAGCA GCTGTCGGAG AAGAAGAAGC TGCATTAACT
AGAGAGTTTC TAAACTGGGA ATATCCTCCT TTTAAAATAC CCGATGAAGT ATATACGCAT
TTTAGAAAAT CAATAAACAA AGGTGAAAAT TTAGAGCAAG AATGGGATTC TAAATTTGAA
GAATATCAAA AAAAATATCC CTCTGAAGGA GCCGAATTAA AAAGAATGTT AGAGGGTCAA
TTACCTGAGA ATTGGGACTC AGACCTCCCC TCTTATTCGC CTAATGATAA AGGTTTAGCC
ACAAGAAAGC ATTCACAAAT ATGTTTGGGT GCTCTAGGTC CTAACCTACC TGAATTAATT
GGCGGATCAG CAGATTTAAC TCACTCTAAT TACACAGATA TAAAAGGAGA AACTGGATCA
TTCCAGCCAC ATAGCCCTGA AAAAAGATAT TTACATTTTG GTGTACGAGA GCATGCAATG
GCAGCTGTAC TTAATGGTAT TGCCTATCAC AATAGTGGTC TTATCCCTTA TGGTGGAACC
TTCCTTGTTT TCGCCGATTA TATGAGGGGC TCAATGAGGC TTTCAGCACT TAGCGAATTA
GGAGTAATCT ATGTCTTAAC ACATGATTCA ATTGGTGTAG GAGAAGATGG GCCAACACAT
CAACCTATTG AGACTATCCC TTCTCTTCGC GCAATGCCTA ACATGCTAGT TTTCAGACCT
GGAGATGGCA ACGAGACGAG TGGGGCTTAT AAGCTTGCTA TTCAAAATCG AAAAAGACCT
TCTGCCCTTT GTTTAAGTAG ACAAGGTATG CCAAATCAAG AAAATACTTC GATAGACAAA
GTTGCTCTAG GAGGATATGT AGTTTCCGAT TGTGAAGGAA CACCAGACTT AATATTTATT
GGTACTGGAA GCGAACTGAA TCTTTGCATT GAAGCAAGTA AGGAACTTTC AAGCTTGGGT
AAAAAAATTA GAGTTGTCTC TATGCCTTGT GTAGAACTTT TTGAAGAGCA AGAAGAATCT
TATAAAGAAA GTGTTTTACC TAGTAGTGTG AAAAAGAGAG TTGTAGTAGA AGCAGCTCAT
TCATTTGGTT GGCATAAATA TACAGGTTTT GATGGTCTTT GTATCACTAT GGATAGGTTT
GGTGCATCAG CACCAGGTGG AGAATGTATG AAAAATTTTG GATTTACAGT AGAAAACGTA
GTTAATAAGA CTAAGGAAAT TCTATAA
 
Protein sequence
MVAASVSLES LCVNSIRMLA VDAVNKSNSG HPGLPMGCAP MGYALWQNIL NHNPNNPKWF 
NRDRFVLSAG HGCMLLYSLL HLTGYKSVSI EDIKEFRQWG SKTPGHPETF ETEGVEVTAG
PLGAGISNAV GLAIAETHLA AKFNKPDCNI VDHYTYVIMG DGCNQEGIAS EACSLAGHLK
LGKLIALYDD NQITIDGRTD VSFTEDVLKR YEAYGWHVQH VEDGNHDVKG ITEAIEKAKL
ITDKPSIIKI STTIGYGSPN KSDTAGIHGA AVGEEEAALT REFLNWEYPP FKIPDEVYTH
FRKSINKGEN LEQEWDSKFE EYQKKYPSEG AELKRMLEGQ LPENWDSDLP SYSPNDKGLA
TRKHSQICLG ALGPNLPELI GGSADLTHSN YTDIKGETGS FQPHSPEKRY LHFGVREHAM
AAVLNGIAYH NSGLIPYGGT FLVFADYMRG SMRLSALSEL GVIYVLTHDS IGVGEDGPTH
QPIETIPSLR AMPNMLVFRP GDGNETSGAY KLAIQNRKRP SALCLSRQGM PNQENTSIDK
VALGGYVVSD CEGTPDLIFI GTGSELNLCI EASKELSSLG KKIRVVSMPC VELFEEQEES
YKESVLPSSV KKRVVVEAAH SFGWHKYTGF DGLCITMDRF GASAPGGECM KNFGFTVENV
VNKTKEIL