Gene NATL1_20621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20621 
SymboltktA 
ID4780041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1706093 
End bp1708105 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content41% 
IMG OID640085358 
Producttransketolase 
Protein accessionYP_001015882 
Protein GI124026767 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGCCT TGACTACTTC CCTAGACACA CTTTGCATCA ACAGCATCAG GATGCTCGCT 
GTTGATGCAA TTAATAAATC CAAAAGTGGT CACCCAGGTT TACCTATGGG TTGCGCACCT
ATGGGTTATG CATTATGGGA CAAGCACTTA CGACACAATC CAAAAAACCC GAAATGGTTT
AATCGAGACA GATTTGTTCT TTCAGCGGGA CATGGATGCA TGCTTTTGTA TGCTCTCTTG
CATTTGACTG GCTATGACTC TGTCACCATT GAAGATATAA AAGAATTTAG ACAGTGGGGA
GCTAAAACTC CAGGGCATCC AGAAACCTTC GAAACTCCGG GAGTTGAAGT AACTGCAGGA
CCTTTGGGAG CAGGAATTTC AAATGCAGTT GGATTAGCTA TTGCGGAAGC TCACCTTGCT
GCAAAATTCA ACAAACCTGA TTCAACAGTT GTAGACCATT ACACCTATGT GATCATGGGG
GATGGATGTA ATCAAGAAGG TATTTCCTCA GAAGCATGTT CTCTTGCTGG GCATTTAAAA
CTTGGAAAAT TAATAGCTCT CTATGACGAT AATCACATAA CTATTGATGG AAGAACTGAT
GTCTCATTTA CAGAGGATGT CTTAAAAAGA TATGAAGCAT ATGGATGGCA TGTACAAGAA
ATTCCTGAGG GGAATACAGA TGTTGAAGGC ATATCTCAAG CAATCGAAAA GGCCAAATCA
GTCACTGATA AGCCATCCAT CATCAAAGTA ACAACAACTA TCGGTTACGG TTCTCCAAAT
AAAAGTGATA CTGCAGGTAT TCACGGCGCC CCATTGGGCG AAGAAGAGGC AGAGCTCACT
AGAAAACAAT TAGGTTGGTC ATACAAACCT TTCGAGGTTC CCCAAGATGC TTATGATCAA
TATCGACAAG CCATTCAAAA AGGTGCACAG CTAGAAGAAG AGTGGAATCA AAGCCTAGCT
AAATACAAAG AAAAATATCC TAATGAAGCA ACTCAATTTG AGCGCATGTT GAGAGGTGAG
CTCCCTGAAG GCTGGGATAA AGATTTACCT ACCTATACAT CCGATGATAA AGGGGTTGCT
ACTAGAAAAC ATTCTCAAAT ATGTCTAGGT GCTCTTGGTC CAAACATCCC TGAACTAATA
GGAGGTTCTG CTGATTTAAC TCATTCAAAC TACACAGATA TAAAAGGAGA AACTGGATCT
TTTCAATATG AAAGTCCTGA AAAACGTTAT TTACATTTTG GTGTCAGAGA GCATGCTATG
GCAGCCATAT TGAATGGCAT TGCTTATCAC GACAGTGGTT TAATTCCTTA TGGTGGAACC
TTCTTAGTCT TCGCAGATTA CATGAGAGGA TCGATGCGTC TTTCTGCTCT TAGTGAGCTT
GGAGTTATTT ATGTTTTAAC CCATGATTCC ATAGGTGTTG GTGAAGATGG CCCAACACAT
CAACCTATAG AAACCATCCC ATCATTGAGA GCAATGCCAA ATATGATGGT TTTCCGTCCT
GGCGATGGCA ATGAAACCAG TGGTGCTTAT AAAGTTGCAA TTAAAAATCG TAAGAGACCA
AGTTCCTTAT GCCTAAGTAG GCAGGGTATG GCAAATCAAC AAAATTCATC CGTAGACAAA
GTTGCTTTAG GTGGATATGT ACTTGAGGAG TGCGATGGCA CCCCAGAACT AATACTTATC
GGAACCGGAA CTGAACTTGA TTTATGTGTT CAAGCAGCGA AAAAGTTAAC TAAGGAAGGT
CGAAAAGTGC GTGTTGTTTC TATGCCATGC GTTGAACTTT TTGAAGAACA AAGCGATAGT
TATAAAGAAG AAGTTTTGCC TTCAAATATC AGAAAACGCC TAGTAGTTGA AGCCGCAGAG
AGTTTCGGAT GGCACAAATA TATTGGTCTT GATGGTGACA GCGTAACTAT GAATAGCTTT
GGAGCATCTG CTCCAGGTGG ATTATGTATG GAAAAATTTG GATTTACAGT TGAAAACGTA
CTAGAAAAAT CTAAAAGTCT GCTCAACAAA TAA
 
Protein sequence
MVALTTSLDT LCINSIRMLA VDAINKSKSG HPGLPMGCAP MGYALWDKHL RHNPKNPKWF 
NRDRFVLSAG HGCMLLYALL HLTGYDSVTI EDIKEFRQWG AKTPGHPETF ETPGVEVTAG
PLGAGISNAV GLAIAEAHLA AKFNKPDSTV VDHYTYVIMG DGCNQEGISS EACSLAGHLK
LGKLIALYDD NHITIDGRTD VSFTEDVLKR YEAYGWHVQE IPEGNTDVEG ISQAIEKAKS
VTDKPSIIKV TTTIGYGSPN KSDTAGIHGA PLGEEEAELT RKQLGWSYKP FEVPQDAYDQ
YRQAIQKGAQ LEEEWNQSLA KYKEKYPNEA TQFERMLRGE LPEGWDKDLP TYTSDDKGVA
TRKHSQICLG ALGPNIPELI GGSADLTHSN YTDIKGETGS FQYESPEKRY LHFGVREHAM
AAILNGIAYH DSGLIPYGGT FLVFADYMRG SMRLSALSEL GVIYVLTHDS IGVGEDGPTH
QPIETIPSLR AMPNMMVFRP GDGNETSGAY KVAIKNRKRP SSLCLSRQGM ANQQNSSVDK
VALGGYVLEE CDGTPELILI GTGTELDLCV QAAKKLTKEG RKVRVVSMPC VELFEEQSDS
YKEEVLPSNI RKRLVVEAAE SFGWHKYIGL DGDSVTMNSF GASAPGGLCM EKFGFTVENV
LEKSKSLLNK