Gene SeAg_B4407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4407 
SymbolthiC 
ID6793012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4298680 
End bp4300575 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content58% 
IMG OID642778502 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002149072 
Protein GI197250591 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.679063 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACAA CAACGTTAAC CCGCCGCGAG CAGCGCGCTA AAGCCCAGCA TTTTATCGAT 
ACGCTGGAAG GGACTGCCTT TCCCAACTCG AAACGCATCT ACGTGACCGG TTCGCAGCAT
GATATTCGCG TACCGATGCG CGAAATTCAA CTTAGCCCAA CGCTCATCGG CGGCAGCAAA
GACAACCCGC AGTTTGAAGA GAACGAAGCC GTGCCGGTGT ACGACACCTC CGGCCCCTAT
GGCGATCCTG AGGTGGCGAT TAACGTCCAG CAGGGTCTGG CAAAGCTACG CCAGCCGTGG
ATTGAAGCGC GGGCTGATGT AGAAACGCTT GCTGACCGCA GTTCTGCCTA TACCCGCGAA
CGCTTGACAG ATGAAGGGCT TGACGCATTA CGCTTTACCG GTCTGTTAAC GCCAAAACGC
GCCAAAGCCG GACACCGCGT GACGCAACTT CATTATGCCC GCCAGGGGAT CGTCACTCCC
GAAATGGAGT TCATCGCCAT CCGTGAAAAT ATGGGCCGCG AGCGCATTCG CAGTGAAGTA
CTGCGCCACC AGCATCCGGG GATGAGCTTT GGCGCGCGCC TGCCGGAAAA CATTACCCCG
GAATTCGTGC GTGATGAAGT CGCCGCGGGC CGCGCGATTA TTCCCGCCAA CATCAACCAC
CCGGAATCCG AGCCGATGAT TATCGGCCGC AACTTCCTGG TCAAAGTGAA TGCCAACATC
GGCAACTCGG CGGTGACCTC CTCCATCGAA GAAGAGGTGG AAAAACTGGT GTGGTCGACC
CGCTGGGGCG CGGACACGGT AATGGACCTT TCCACCGGCC GCTATATTCA CGAAACCCGT
GAGTGGATCC TGCGTAACAG CCCGGTACCG ATCGGCACCG TCCCGATCTA CCAGGCGCTG
GAAAAGGTCA ACGGGATCGC CGAAGATCTT ACCTGGGAAG CGTTCCGCGA CACGCTGCTG
GAGCAGGCCG AACAGGGCGT CGACTACTTC ACCATCCACG CGGGCGTGCT GCTGCGCTAC
GTGCCGATGA CCGCCAAGCG CCTGACCGGC ATTGTCTCAC GCGGCGGTTC AATCATGGCG
AAATGGTGCC TTTCTCATCA CAAAGAGAAC TTCCTGTTCG AACATTTCCG CGAGATTTGT
GAAATCTGCG CCGCCTACGA CGTTTCCCTG TCGCTGGGCG ACGGCCTGCG TCCCGGCTCC
ATTCAGGACG CCAACGACGA AGCGCAGTTC TCCGAGCTGC ATACGCTGGG CGAATTGACC
AAAATCGCCT GGGAATACGA CGTGCAGGTG ATGATTGAAG GCCCGGGCCA CGTACCCATG
CATATGATCC AGCGCAATAT GACCGAAGAG CTGGAGAGCT GCCATGAAGC ACCGTTCTAC
ACCTTAGGGC CATTGACCAC CGATATCGCG CCGGGCTATG ACCACTTCAC CTCCGGGATC
GGTGCCGCGA TGATCGGCTG GTTTGGCTGC GCGATGCTGT GTTATGTGAC GCCGAAAGAA
CACCTCGGCC TGCCGAACAA AGAAGATGTG AAGCAGGGGC TAATCACCTA CAAAATCGCC
GCCCACGCCG CGGATTTAGC CAAAGGACAT CCGGGCGCAC AGATCCGCGA TAACGCCATG
TCGAAAGCGC GCTTTGAATT CCGCTGGGAA GATCAGTTTA ACCTCGCGCT CGACCCGTTC
ACCGCCCGCG CTTATCACGA TGAAACCCTG CCGCAGGAGT CCGGTAAGGT CGCTCACTTC
TGTTCCATGT GCGGGCCGAA GTTCTGCTCG ATGAAAATCA GCCAGGAAGT CCGTGACTAC
GCCGCCGCGC AAACCATTGA AGTGGGGATG GCGAATATGT CGGAAAACTT CCGCGCCAAA
GGCGGCGAAA TTTATCTCAA GCGGGAGGAA GTCTGA
 
Protein sequence
MSTTTLTRRE QRAKAQHFID TLEGTAFPNS KRIYVTGSQH DIRVPMREIQ LSPTLIGGSK 
DNPQFEENEA VPVYDTSGPY GDPEVAINVQ QGLAKLRQPW IEARADVETL ADRSSAYTRE
RLTDEGLDAL RFTGLLTPKR AKAGHRVTQL HYARQGIVTP EMEFIAIREN MGRERIRSEV
LRHQHPGMSF GARLPENITP EFVRDEVAAG RAIIPANINH PESEPMIIGR NFLVKVNANI
GNSAVTSSIE EEVEKLVWST RWGADTVMDL STGRYIHETR EWILRNSPVP IGTVPIYQAL
EKVNGIAEDL TWEAFRDTLL EQAEQGVDYF TIHAGVLLRY VPMTAKRLTG IVSRGGSIMA
KWCLSHHKEN FLFEHFREIC EICAAYDVSL SLGDGLRPGS IQDANDEAQF SELHTLGELT
KIAWEYDVQV MIEGPGHVPM HMIQRNMTEE LESCHEAPFY TLGPLTTDIA PGYDHFTSGI
GAAMIGWFGC AMLCYVTPKE HLGLPNKEDV KQGLITYKIA AHAADLAKGH PGAQIRDNAM
SKARFEFRWE DQFNLALDPF TARAYHDETL PQESGKVAHF CSMCGPKFCS MKISQEVRDY
AAAQTIEVGM ANMSENFRAK GGEIYLKREE V