Gene Sde_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0503 
Symbol 
ID3965617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp615993 
End bp617444 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content48% 
IMG OID637919566 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_525979 
Protein GI90020152 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000447545 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTTG TTGTTCGTTT ATTCCCTGAA ATATCCATCA AAAGTGCCCC CGTGCGCAAG 
CGCTGGACAA AAATGCTAAC CGATAACTTA CGGTTGATGG CTAAGCGTAT CCATCCAACC
GCTAGGGTGA ACAAAGAGTG GGATCGTATC GAGGTGACGG CCAAGGTGGA TGACCCCGTT
GTGGAGGGGC AGTTAATCGA TATGTTGGCG CGTACACCGG GAATTGCGAA CTTTTCCCAT
GTGCAAACGC ACCCTTTTGA ATCATTGCAT GATATTTACG AGTTGGTACA AGCATCATGG
GGTGACCAAC TTAAGGGCAA AACCTTTTGT GTGCGAGTAA AGCGCACTGG CAACCACGAT
TTTACTTCTA CCGAAGTCGA GCGCTATGTG GGTGGTGGCC TTAATCAAAA TAACCCCACT
GGCGGCGTAA AACTAAAAGA CCCTGATGTG TCTATTGGTT TGGAAGTAAA AGATGATCAG
GTGTATTTGG TAACCAAAAA ATATCAGGGC TTAGGCGGCT TCCCCATGGG GACGCAGGAG
TCTGTGCTAT CTCTTATCTC TGGCGGCTAC GATTCTACCG TTGCCAGCTT TCAAATGATC
AAGCGCGGTT TGCGCACCCA TTACTGCTTT TTTAATTTAG GGGGGCGCGA GCACGAGCTA
GCAGTTAAAG AAATCGCGTT CTTTTTATGG AATCGATTCG GTTCTACTCA CCGCGTGCGT
TTTATCTCGG TGCCGTTTGA AGGTGTTGTA GGGGAAATTC TGCAAAAGGT TGGCCCTTCA
AATATGGGCG TAGTACTTAA GCGTATGATG TTGCGTGCAG GTGAGCGCAT TGCCGAGCGC
GGCGGTATTG AGGCAATGGT GACCGGCGAG GCGGTGGCGC AGGTATCGAG CCAAACCATT
CCCAACTTAT CTGTGATTGA TAGCGTTACC GATATGATGG TATTGCGACC GCTTATTGTG
ATGGATAAAC GCGATATTAT TGATATCTCC CGCAAAATTG GCGCAGAAGA GTTCTCTGCT
GCTGTGCCTG AGTATTGCGG TGTTATTTCG GTAAAACCCT CTGCAAAAGT GAATCGTGCC
AAGCTTGAGG CGGAAGAAGA AAAGTTTGAT TTTTCTATCC TCGAACACGC GCTAGAAAAC
GCGGTAGTGC AATCTATCGA TGAGGTGATG GATGATGCGC AAGAGCTTGC CGAGGTTGAA
TTGGTGTCGG AATTGCCCGT CAGTGCCAAA GTTATTGATA TTCGCCATCA CACCGAGCAA
GAACTGCGAC CCCTTACTGT TGAAGGGCGC GAAGTGTTGG AAATACCTTT TTACCAATTA
AGCACAGCTT ATGCAGAGTT GGATAAAGCC GTTAACTATT ATTTATTTTG CGATAAGGGG
GTAATGAGCG GGCTGCATGC TCGGCATTTG CTAGATGCTG GCTATACTAA TGTAGGTGTT
TATCGGCCTT AG
 
Protein sequence
MRFVVRLFPE ISIKSAPVRK RWTKMLTDNL RLMAKRIHPT ARVNKEWDRI EVTAKVDDPV 
VEGQLIDMLA RTPGIANFSH VQTHPFESLH DIYELVQASW GDQLKGKTFC VRVKRTGNHD
FTSTEVERYV GGGLNQNNPT GGVKLKDPDV SIGLEVKDDQ VYLVTKKYQG LGGFPMGTQE
SVLSLISGGY DSTVASFQMI KRGLRTHYCF FNLGGREHEL AVKEIAFFLW NRFGSTHRVR
FISVPFEGVV GEILQKVGPS NMGVVLKRMM LRAGERIAER GGIEAMVTGE AVAQVSSQTI
PNLSVIDSVT DMMVLRPLIV MDKRDIIDIS RKIGAEEFSA AVPEYCGVIS VKPSAKVNRA
KLEAEEEKFD FSILEHALEN AVVQSIDEVM DDAQELAEVE LVSELPVSAK VIDIRHHTEQ
ELRPLTVEGR EVLEIPFYQL STAYAELDKA VNYYLFCDKG VMSGLHARHL LDAGYTNVGV
YRP