Gene Spro_0284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_0284 
Symbol 
ID5607051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp325689 
End bp327632 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content58% 
IMG OID640935783 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001476522 
Protein GI157368533 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00835607 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTAACG TTAACAAACC CCGCGCTCGT AAAGAGCAAC GCGAAGAAGC CCAGCAGTTT 
ATCAACACCC TGCAAGGCGT TTCCTTCCCC AATTCACGCC GTATCTATCT GCAGGGCTCA
CGTAGTGATC TGCAGGTGCC AATGCGTGAA ATTCAGCTCA GCCCGACACT GATCGGTGGC
GATAAGGACA ACCCGAAGTA CGAGCCAAAC GAAGCGATTC CGGTGTACGA CACTGCCGGC
CCTTACGGTG ATCCCCAATC AGAACTCAAC GTGCATAGCG GCCTGGCGAA GCTACGCGCC
GGTTGGATCG CTGAACGCGG TGATACCGAA GCGCTCAAGG GTGTCAGTTC AGGTTTCACC
CAGCAGCGCC TGGCCGATGA AGGCCTGGAT CACCTGCGCT TTGAACATTT GCCGTTGCCG
CGCAAGGCAC AACCGGGCCA ATGCGTGACC CAGTTGCACT ACGCGCGTGC TGGCACCGTC
ACCCCAGAAA TGGAATTTAT CGCCATTCGC GAAAACATGG GGCGCGAGCG CATCCGTGGC
GAGGTATTAC GCCACCAGCA TCCTGGCCAA AGTTGGGGGG CCAACCTGCC GGAGAACATC
ACGCCGGAAT TCGTTCGCCA GGAAGTCGCC GCCGGGCGCG CCATCATCCC CGCCAATATC
AACCACCCGG AATCCGAACC GATGATCATC GGCCGTAACT TCCTGGTGAA GGTTAACGCC
AACATCGGCA ACTCGGCGGT GACCTCCTCG ATCGAGGAAG AGGTCGAGAA ACTGGTGTGG
TCCACCCGCT GGGGCGCGGA TACCGTGATG GATCTGTCTA CCGGCCGCTA TATCCACGAA
ACCCGCGAAT GGATCCTGCG TAACAGCCCG GTGCCAATTG GCACGGTGCC TATCTATCAG
GCGCTGGAAA AGGTGAACGG CGTGGCGGAA AACCTCACCT GGGAAATGTT CCGCGATACG
CTGCTGGAGC AGGCGGAACA AGGGGTTGAC TACTTCACCA TTCACGCCGG CGTGCTGCTG
CGCTATGTGC CGATGACCGC CAAACGCCTG ACCGGCATCG TTTCCCGCGG CGGTTCGATC
ATGGCCAAAT GGTGCCTGTC ACATCATCAG GAAAACTTCC TGTACCAGCA CTTCCGCGAA
ATCTGTGAGA TTTGCGCCGC CTATGACGTT TCTCTTTCCC TCGGCGACGG CCTGCGGCCA
GGCTCAATTC AGGACGCCAA CGACGAGGCA CAGTTCGCCG AACTGCACAC CTTGGGTGAG
TTGACCAAAA TTGCCTGGGA ATATGATGTG CAGGTAATGA TCGAAGGCCC TGGCCACGTG
CCGATGCAGA TGATCCGCCG CAACATGACC GAGGAGCTGG AGCACTGCCA CGAAGCACCA
TTCTATACTC TTGGCCCACT GACCACCGAT ATCGCTCCGG GTTATGACCA CTTTACCTCC
GGCATCGGGG CCGCGATGAT CGGCTGGTTC GGCTGCGCCA TGCTGTGTTA CGTCACGCCA
AAGGAGCATC TCGGGCTGCC GAACAAAGAG GACGTCAAGC AGGGGCTGAT CACCTACAAA
ATTGCCGCTC ACGCCGCAGA CCTGGCAAAA GGCCACCCCG GTGCGCAAAT CCGCGATAAC
GCCATGTCCA AGGCGCGCTT CGAATTCCGC TGGGAGGATC AGTTCAATCT GGCGCTCGAT
CCAGCCACTG CTCGCGCTTA TCACGACGAA ACCCTGCCGC AAGAGTCCGG CAAAATCGCC
CACTTCTGCT CTATGTGCGG GCCAAAATTC TGCTCGATGA AAATTTCGCA GGAAGTTCGC
GACTATGCCG CCGCGCAGGA AGCGGCCAAA CCGATAGCAG TGCAGTTAAC CGGCATGGAA
AAGATGTCGG CCGAGTTCCG CTCACGCGGC AGCGAGCTGT ACCACAGCGC CGGTAACCTG
CAAGAGGAAT TAAACAATGA CTGA
 
Protein sequence
MSNVNKPRAR KEQREEAQQF INTLQGVSFP NSRRIYLQGS RSDLQVPMRE IQLSPTLIGG 
DKDNPKYEPN EAIPVYDTAG PYGDPQSELN VHSGLAKLRA GWIAERGDTE ALKGVSSGFT
QQRLADEGLD HLRFEHLPLP RKAQPGQCVT QLHYARAGTV TPEMEFIAIR ENMGRERIRG
EVLRHQHPGQ SWGANLPENI TPEFVRQEVA AGRAIIPANI NHPESEPMII GRNFLVKVNA
NIGNSAVTSS IEEEVEKLVW STRWGADTVM DLSTGRYIHE TREWILRNSP VPIGTVPIYQ
ALEKVNGVAE NLTWEMFRDT LLEQAEQGVD YFTIHAGVLL RYVPMTAKRL TGIVSRGGSI
MAKWCLSHHQ ENFLYQHFRE ICEICAAYDV SLSLGDGLRP GSIQDANDEA QFAELHTLGE
LTKIAWEYDV QVMIEGPGHV PMQMIRRNMT EELEHCHEAP FYTLGPLTTD IAPGYDHFTS
GIGAAMIGWF GCAMLCYVTP KEHLGLPNKE DVKQGLITYK IAAHAADLAK GHPGAQIRDN
AMSKARFEFR WEDQFNLALD PATARAYHDE TLPQESGKIA HFCSMCGPKF CSMKISQEVR
DYAAAQEAAK PIAVQLTGME KMSAEFRSRG SELYHSAGNL QEELNND