Gene Ssed_2013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_2013 
Symbol 
ID5614092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp2436682 
End bp2438640 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content49% 
IMG OID640932899 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001473750 
Protein GI157375150 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.715705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.547634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACAC GTCGTGAAAC CCGAGCTCAA GCTCAACAAT TTATTGATAA TCTAAAACCA 
CTTCAACATC CAAATTCTGA GAAAGTTTAC TTAGTCGGTA GCCGTGATGA TATTCGGGTA
GGTATGCGTC AAATACACCA ATCAGAGACC ATGATTGGCG GCACCGAATC TCATCCCGTT
TTAGAATCAA ATCCGCCATT GAAAGTGTAC GATTGTGCCG GTCCATATTC CGATCCCAAC
GCCAAGATTA ACGTCCGTGA AGGTCTGGAT AAGTTCAGGG CAAATTGGAT CCTTGAGCGC
AATGATACCG AGCAATTGAT CGCTGCCAGT TCTGGTTTTA CACAGCAAAG GTTAGCCGAT
TACGGACTCG ACCATCTGCG TTTCGACTCG CTTCTTTCAC CCAGAAAGGC CAAACAAGGT
CAGTGTGTGA CTCAACTTCA TTATGCCAGA CAGGGGATAG TTACCCCGGA AATGGAGTAC
ATTGCCATCA GAGAGAATAT GGCTTTGAGT GAAGTGACCG ATGAAGCGTT AACTCAGAAA
GCAGAAGGTG AAAGTTTCGG CGCGGCGATA AGTCAGCCTA TTACCCCTGA GTTTGTCCGG
CAAGAGGTGG CACGGGGACG TGCCATTATT CCGCTGAATA TTAACCATCC CGAAGCCGAA
CCTATGATTA TCGGTCGTAA CTTTCTGGTT AAAGTGAACG CCAATATAGG TAACTCTGCG
GTGACCTCTT CAATCGAAGA GGAAGTTGAA AAACTGGTTT GGTCGACGCG TTGGGGAGCC
GATACGGTAA TGGATCTATC GACGGGTCGA TATATTCATG AAACAAGGGA ATGGATCATA
CGAAACTCGC CAGTGCCTAT TGGTACCGTG CCCATCTACC AAGCTCTGGA GAAGGTCAAT
GGCGTGGCGG AAGACCTCAC ATGGGAGATA TTCAGAGATA CTCTGCTTGA ACAGGCAGAG
CAAGGCGTCG ATTACTTTAC TATCCACGCA GGTGTGTTGC TTCGCTACGT CCCGATGACG
GCCAAGCGTT TGACCGGCAT CGTCTCACGG GGCGGATCAA TTATGGCAAA GTGGTGTCTC
TCTCACCATC TTGAAAACTT CTTGTATGAA CACTTCCGGG ATATCTGTGA GCTTTGCGCC
GCCTATGACG TTTCATTGTC ACTCGGTGAT GGAATGCGAC CGGGATCAAT TGCCGATGCT
AACGATGAAG CACAATTTTC TGAACTCGAA ACCTTAGGTG AGTTAGTTAA GATTGCCTGG
GAGTACGATG TGCAAACGAT CATCGAGGGG CCGGGTCATA TCCCCATGAA TCTGATTAAA
GAGAACATGG ATAAGCAGCT TGAGGTGTGT GATGAAGCGC CATTTTATAC CTTAGGCCCT
CAGACTACCG ATATCGCACC GGGTTATGAT CATTTCACTT CGGGGATCGG CGCGGCAATG
ATTGCCTGGT ATGGCTGTGC CATGCTCTGC TATGTGACGC CAAAAGAGCA CTTAGGTTTA
CCGAACAAGG AAGACGTCAA GCAAGGCTTG ATCACCTATA AGATTGCCGC TCATGCCGGT
GATGTGGCTA AGGGTCATCC GACAGCACAA ATTCGAGACA ATGCCCTGTC CAAGGCAAGG
TTTGAGTTTC GCTGGGAAGA TCAATATAAC TTAGGACTGG ATCCTGACAC CGCCAGAGCC
TATCACGATG AATCGCTTCC GCAGGAATCG GCGAAAGTTG CTCATTTCTG CTCCATGTGC
GGGCCCAAGT TCTGTTCGAT GAAGATAAGC CAAGAGGTCA GAGAATATGC GGCGGCGCAA
GAGGTTAAGC TGCATACAGA CACGGAGTTT AAGGCTAAGA GTGTCAAAGA GTCGGGTATG
GCTCAGATGT CTGCAGAATT TAAGGCTAAG GGGGCGGCGC TTTACCATGA AAGCGGCGCG
TTGGTGGAAG ATGTTGAACT TGTAGAAACC GAGGGCTAG
 
Protein sequence
MSTRRETRAQ AQQFIDNLKP LQHPNSEKVY LVGSRDDIRV GMRQIHQSET MIGGTESHPV 
LESNPPLKVY DCAGPYSDPN AKINVREGLD KFRANWILER NDTEQLIAAS SGFTQQRLAD
YGLDHLRFDS LLSPRKAKQG QCVTQLHYAR QGIVTPEMEY IAIRENMALS EVTDEALTQK
AEGESFGAAI SQPITPEFVR QEVARGRAII PLNINHPEAE PMIIGRNFLV KVNANIGNSA
VTSSIEEEVE KLVWSTRWGA DTVMDLSTGR YIHETREWII RNSPVPIGTV PIYQALEKVN
GVAEDLTWEI FRDTLLEQAE QGVDYFTIHA GVLLRYVPMT AKRLTGIVSR GGSIMAKWCL
SHHLENFLYE HFRDICELCA AYDVSLSLGD GMRPGSIADA NDEAQFSELE TLGELVKIAW
EYDVQTIIEG PGHIPMNLIK ENMDKQLEVC DEAPFYTLGP QTTDIAPGYD HFTSGIGAAM
IAWYGCAMLC YVTPKEHLGL PNKEDVKQGL ITYKIAAHAG DVAKGHPTAQ IRDNALSKAR
FEFRWEDQYN LGLDPDTARA YHDESLPQES AKVAHFCSMC GPKFCSMKIS QEVREYAAAQ
EVKLHTDTEF KAKSVKESGM AQMSAEFKAK GAALYHESGA LVEDVELVET EG