Gene SAG1372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1372 
SymbolthiI 
ID1014181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1381150 
End bp1382364 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content37% 
IMG OID637316548 
Productthiamine biosynthesis protein ThiI 
Protein accessionNP_688370 
Protein GI22537519 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000114319 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTATT CAGAAATTAT GATTCGTTAT GGAGAACTCT CTACTAAGAA GAAAAACCGT 
ATGCGCTTCA TCAATAAGTT AAAAAATAAT ATGGAGCATG TACTCTCCAT TTATCCAGAT
GTTTCAGTAA AAACAGATCG TGATAGAGGA CATGTATATC TCAATGGTAC AGATTATCAT
GAAGTTGCAG AGTCCTTAAA AGAGATTTTT GGTATCCAAG CTTTTTCTCC ATCTTTTAAA
GTAGAAAAAA ATGTTGATAC ATTGGTAAAA GCTGTCCAGG AAATTATGAC TTCCGTTTAT
AAAGATGGGA TGACTTTTAA AATTACCGCA AAACGTAGTG ACCACTCATT TGAATTGGAT
AGCCGTGCTC TAAATCATAC TTTAGGAGAT GCTGTTTTTT CAGTCTTGCC AAATATTAAG
GCTCAGATGA AGCAACCAGA TATCAATCTT AAAGTCGAGA TACGAGATGA GGCTGCTTAT
ATTTCATATG AGGATATTAG GGGTGCAGGA GGATTACCAG TAGGAACATC TGGAAAAGGG
ATGCTGATGT TGTCTGGTGG GATTGATTCT CCGGTAGCAG GTTACCTAGC GTTAAAACGT
GGTGTAGATA TAGAAGCAGT CCATTTTGCA AGTCCTCCTT ATACTAGCCC AGGTGCATTG
AAAAAAGCAC ATGATTTAAC ACGTAAATTG ACAAAATTTG GTGGTAATAT TCAATTTATT
GAAGTTCCAT TCACAGAAAT TCAAGAGGAA ATTAAGGCAA AAGCTCCCGA AGCCTACTTG
ATGACGTTAA CACGTAGGTT TATGATGCGT ATTACAGATC GTATTCGTGA GGACCGAAAT
GGTCTTGTTA TTATTAACGG TGAAAGTTTA GGACAGGTGG CAAGCCAAAC GTTAGAAAGT
ATGCAAGCTA TTAATGCTGT CACTGCAACA CCGATTATTC GTCCTGTGGT CACGATGGAT
AAGCTAGAAA TTATTGATAT TGCTCAAAAA ATAGATACTT TTGATATTTC AATTCAACCA
TTTGAGGATT GCTGTACGAT TTTTGCACCA GATCGCCCAA AAACTAACCC TAAAATTAAG
AATACAGAAC AGTATGAGAA ACGTATGGAT GTAGAAGGTC TTGTAGAGAG GGCAGTTGCA
GGGATTATGG TAACTACTAT TCAACCTCAA GCAGATAGTG ATGATGTTGA TGACTTGATT
GACGATTTAT TATAA
 
Protein sequence
MQYSEIMIRY GELSTKKKNR MRFINKLKNN MEHVLSIYPD VSVKTDRDRG HVYLNGTDYH 
EVAESLKEIF GIQAFSPSFK VEKNVDTLVK AVQEIMTSVY KDGMTFKITA KRSDHSFELD
SRALNHTLGD AVFSVLPNIK AQMKQPDINL KVEIRDEAAY ISYEDIRGAG GLPVGTSGKG
MLMLSGGIDS PVAGYLALKR GVDIEAVHFA SPPYTSPGAL KKAHDLTRKL TKFGGNIQFI
EVPFTEIQEE IKAKAPEAYL MTLTRRFMMR ITDRIREDRN GLVIINGESL GQVASQTLES
MQAINAVTAT PIIRPVVTMD KLEIIDIAQK IDTFDISIQP FEDCCTIFAP DRPKTNPKIK
NTEQYEKRMD VEGLVERAVA GIMVTTIQPQ ADSDDVDDLI DDLL