Gene Sde_0369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0369 
Symbol 
ID3967616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp456219 
End bp458135 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content50% 
IMG OID637919432 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_525845 
Protein GI90020018 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACCG ATACATTAGA AAAACCACGC TTGAGTGATA CCGCACAGGT AGACAGTCAA 
TCCATTGCGC CGTTTCCAAA TTCTAAAAAG ATCTACGTGC AGGGTAGTCG CCCAGATATT
CGCGTACCTA TGCGCGAAAT TAATTTATCT ATAACGCCAA CGGAATTTGG TGGCGAACAG
AATCCACCCG TGCGTGTTTA CGACACTTCT GGTGTGTACA CCGACCCCAA TGTAAAAATA
GATGTGCGCC AAGGTTTGCC CGATGTGCGC AGCGCTTGGA TAGCCGAGCG CGGTGACACC
GAAGTGCTGC AACAAAAAAG TTCGTCTTTT ACCCAGCAGC GCTTACACGA TGCAAGCTTG
GATACCTTGC GTTTTAATCA CCAGCGCCAG CCCCTTAAAG CCAAGCCGCG CGCAAACGTA
ACGCAAATGC ACTACGCGCG CTGCGGCATT ATTACCCCAG AAATGGAATA TATTGCCATT
CGCGAAAATA TGAGCTGGCA GCAAGCCAAA GAGCAAGGCG TGTTAGATCA GCAGCATGCC
GGCGAGCATT TTGGCGCAAA CATCCCAGAT GAAATTACAC CAGAATTTGT GCGCTCTGAA
GTGGCCTGCG GCCGCGCAAT TATTCCTGCA AATATTAACC ACCCCGAACT AGAGCCAATG
ATTATTGGCC GCAACTTTTT AGTAAAAATT AACGGCAATA TCGGCAACAG TGCGGTTACC
TCATCTATTG AAGAAGAAGT GGCGAAGTTA ACCTGGGGCA CGCGCTGGGG TGCCGATACC
ATTATGGATC TGTCCACCGG TAAAAATATT CACGAAACGC GCGAGTGGAT TATTCGCAAC
TCGTCAGTGC CCATTGGTAC AGTACCTATT TACCAAGCTT TAGAAAAAGT AGATGGCGTA
GCCGAAGATC TAACGTGGGA GATTTTCCGC GATACCCTCA TCGAGCAAGC AGAGCAAGGG
GTTGACTACT TCACAATCCA CGCCGGTGTA CTGTTGCGCT ATGTGCCGCT TACCGCTAAA
CGGGTAACAG GTATTGTGTC GCGCGGCGGC TCGATTATGG CTAAATGGTG CTTGGCGCAT
CACCGCGAAA ACTTTTTATA CACCCATTTC GAAGACATTT GCGAAATTAT GAAAGCTTAC
GATGTGAGCT TTTCTTTGGG GGATGGCTTG CGCCCAGGCT CCATTGCCGA CGCCAACGAC
GAAGCGCAAT TCGGCGAGCT AGAAACACTG GGCGAGCTTA CCAAAATTGC GTGGAAACAC
GATGTGCAGG TAATGATTGA AGGCCCAGGC CACGTACCAA TGCACATGAT CAAAGAAAAC
ATGGATAAGC AATTGCGCGA ATGTGGTGAA GCGCCGTTTT ATACCTTGGG GCCGCTGACT
ACCGATATCG CCCCAGGCTA CGACCATATT ACCTCGGGTA TTGGTGCGGC CATGATTGGC
TGGTACGGTT GTGCCATGCT TTGTTACGTT ACACCCAAAG AGCATTTGGG TTTACCCAAC
AAAGACGATG TAAAAGAGGG TATCATCACT TACAAAATTG CTGCCCACGC AGCGGATTTG
GCCAAAGGGC ACCCCGGCGC ACAGCTGCGT GACAACGCAC TCTCTAAGGC GCGCTTTGAA
TTCCGTTGGG AAGATCAGTT TAATTTGGGC TTAGACCCAG ATACTGCGCG GTCTTATCAC
GACGAAACGC TGCCAAAAGA TTCCGCTAAA GTTGCGCACT TTTGCTCTAT GTGTGGCCCC
AAGTTCTGCT CGATGAAAAT CACCCAAGAG GTGCGCGATT ACGCAGCAGA ACACGGTACA
GATATTACAC CAATCGCCGA AGATGAAGTG GTACGAATGA TTGATGTAGA AGCCGAAATG
CGCAAGAAGT CGGAGGAGTT CCGCGAGAAG GGCAGTGAGA TATATGGGAA AATCTAG
 
Protein sequence
MTTDTLEKPR LSDTAQVDSQ SIAPFPNSKK IYVQGSRPDI RVPMREINLS ITPTEFGGEQ 
NPPVRVYDTS GVYTDPNVKI DVRQGLPDVR SAWIAERGDT EVLQQKSSSF TQQRLHDASL
DTLRFNHQRQ PLKAKPRANV TQMHYARCGI ITPEMEYIAI RENMSWQQAK EQGVLDQQHA
GEHFGANIPD EITPEFVRSE VACGRAIIPA NINHPELEPM IIGRNFLVKI NGNIGNSAVT
SSIEEEVAKL TWGTRWGADT IMDLSTGKNI HETREWIIRN SSVPIGTVPI YQALEKVDGV
AEDLTWEIFR DTLIEQAEQG VDYFTIHAGV LLRYVPLTAK RVTGIVSRGG SIMAKWCLAH
HRENFLYTHF EDICEIMKAY DVSFSLGDGL RPGSIADAND EAQFGELETL GELTKIAWKH
DVQVMIEGPG HVPMHMIKEN MDKQLRECGE APFYTLGPLT TDIAPGYDHI TSGIGAAMIG
WYGCAMLCYV TPKEHLGLPN KDDVKEGIIT YKIAAHAADL AKGHPGAQLR DNALSKARFE
FRWEDQFNLG LDPDTARSYH DETLPKDSAK VAHFCSMCGP KFCSMKITQE VRDYAAEHGT
DITPIAEDEV VRMIDVEAEM RKKSEEFREK GSEIYGKI