Gene NATL1_16941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16941 
SymbolthiE 
ID4779849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1380541 
End bp1381593 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content36% 
IMG OID640084978 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_001015514 
Protein GI124026399 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAA TACCTGTCAC CCCTCCTTCT GATAATCGTA TTGCTCAATT AATTGACGCG 
AACCTTGATC GCGCTAGAGA GGGGCTTAGA GTTATGGAAG ATTGGTGCAG ATTTGGTTTA
AAGAGGAGTG ATTTTTCGAT TCAAATCAAA GATTGGAGGC AACAATTAGG AGTACATCAC
CACAATATTT ATCGAAAAGA AAGGCTTACA TCCATCGATC CAGCTATGGG CATTTCACAT
CCGTTACAAA CAGTCAGATC AACCCCAGAG GATGTATTTA TTGCAAACTC ATCCAGAGTT
CAAGAAGCCC TAAGAGTAAT AGAGGAATTC ACTCGAAAAA CAGATCCGAA TCTTTGTGAA
ATAGCCAGCA AAATTAGATA CGAAACTTAT GAAATCGAGA TAAAGGTGCT TAAATCGACA
GAAGGCGTAA ATAAAAGAGA AACCTTAAAA TATTGTTCGG TATACTTAAT AACCTCAAAC
AGGAGAGATA TAGAAGAGGT TGTTCTTCAT GCTCTAAAAG CTGGTGTAAA AATAGTTCAA
TATAGAGAGA AATTTTTAAA CGATAATGAA AAAATTTCAC AAGCTAAATG TTTAGCCTCT
CTTTGTAAAA AATTCAATTC ACTTTTTATA GTCAATGACC GTATTGATAT CGCACTCGCC
GTTGACGCAG ATGGAATTCA TTTGGGACAA GAAGATATGC CAACAAAAAT CGCGAGACAA
CTACTAGGGG CTGAAAAAAT CATTGGCAGA AGCACGCACT GTCTTGAAGA CATCAAAAAT
GCCGAAGGAG AAGGCTGTGA TTATATTGGT ATAGGGCCAA TATTTCCTTC TGAAACAAAA
AAGCAACTAA ATCCAATTGG AATTGACAAC CTAAAAAAAG GATTAAGTGA AACTATCCTT
CCTGCTTTTG CTATTGGGGG AATTAATAAA TCAAATATCA CAAAATTAAA TCAGATAAAT
AATCTTCGCA TTGCTGTCTC AAATGCAATC ATTAATTCAA ATGATCCCTT TTCAACAACT
GAAGAGCTTA TCAAATTTCT AATATGCAAT TAA
 
Protein sequence
MKSIPVTPPS DNRIAQLIDA NLDRAREGLR VMEDWCRFGL KRSDFSIQIK DWRQQLGVHH 
HNIYRKERLT SIDPAMGISH PLQTVRSTPE DVFIANSSRV QEALRVIEEF TRKTDPNLCE
IASKIRYETY EIEIKVLKST EGVNKRETLK YCSVYLITSN RRDIEEVVLH ALKAGVKIVQ
YREKFLNDNE KISQAKCLAS LCKKFNSLFI VNDRIDIALA VDADGIHLGQ EDMPTKIARQ
LLGAEKIIGR STHCLEDIKN AEGEGCDYIG IGPIFPSETK KQLNPIGIDN LKKGLSETIL
PAFAIGGINK SNITKLNQIN NLRIAVSNAI INSNDPFSTT EELIKFLICN