Gene Sde_1422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1422 
Symbol 
ID3966100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1840553 
End bp1841854 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content47% 
IMG OID637920499 
ProductThiS, thiamine-biosynthesis 
Protein accessionYP_526896 
Protein GI90021069 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00197001 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAATACC CACTATTTGT AGTTTTAGGA CTCGCCGCCA CCCCACTTTA TGCAGGCACT 
GGCAGCTCAG CCAGCGGCAC TAACTTCACT TCCGGCCCTA GCGGCAACCA CACAGCCCTT
TTCTCTGCAG ACCACAACCC TGCAATGAAT AACTTTATGG TGCACGAAGA AGAAACCTAT
CGGTTTAACT TCGGCCCAGC ATTTAGCTAT GGCATAGAGC TTGGCGACGT AGGTAACTTC
GCAGATGATC TTGATGACCT TATCGATATT ATTGACGACC CAGACAGCAC CGACGAATCT
CCCTCAGATG TGTTAGAACG CTTTAACAGT GTTTTAGAAA ATATGGGGGA ACACGGTTAT
ATAAAAAATA CCATTAATGT TAGAGCCCCT GTGCTGCCAC TATTTTACAA ATCAGACCGG
TTTGGCGGCA CCTTCTCAGT CGATTTCAGT TTGGACGCCA CAATAGGTGC GCGCGTACTC
GATGACGAAT TGAAATACGA CCAAGATAAA GACTACCTCA CCAACACCGC TATTTATTTA
AAAAGTGGTA TCGAAAAGCG TCTGTCATTT GGTTACGGCC GCGAGTTATG GTCCTACAAC
GACATGGGAA AACTATACGG TGGTGCTCGC TTAGATATTT CCAACATGGA GCTAAGTAAA
CAAGTTATGC CGCTTCAAAT GCTAGACGGC AGAGATATAA GCGACGTAAT GACGGACGAA
TACGATAAGA ACTTAGTGTC TACCACTGCC ATGTCTGTCG ATATTGGTGT TGTGTGGGAT
GCCGACTTTT ATCGCCTTGG CCTAACCTTG GCAGACATCA ATTCACCTAG TTTCGATTAC
GGCGCAGTAG GTGTGAATTG TGACGAACGC GAACCAGGCT CCACAGAGCA ATCGGCATGT
TACGCAACCG AATATTTTAT TGAAGTAAAA GGGGACCTAA AAGCCTACGA AACTCATACT
AAGCATGCGC GCGCGACTGT TGATGGCGCA CTGCATATAG GCAATAAATG GTGGCTCACC
TCTGCACTAG ACTTGGCCGC ATACGACGAC CCCGTAGGCT TTGAAAACCA ATGGTTTCAC
CTAGCCGCCA GCTACGAAAC CGGCGGCTTC TGGCTGCCCT CTTTACGCTC TGGCTATCAA
GTAAATTTAA CCGGTAGCGA AACAAGCAGC CTAAATGTGG GCGCAACATT TTTCAAAATG
CTGAACTTTG ATATTGAGTA CGGTTTAGAA AGCGTGGAAG TGGACGGCTC CACAGGCCCA
AGACGGCTAG GATTTGCCCT AAGCTTAACC GAACATTTTT AA
 
Protein sequence
MKYPLFVVLG LAATPLYAGT GSSASGTNFT SGPSGNHTAL FSADHNPAMN NFMVHEEETY 
RFNFGPAFSY GIELGDVGNF ADDLDDLIDI IDDPDSTDES PSDVLERFNS VLENMGEHGY
IKNTINVRAP VLPLFYKSDR FGGTFSVDFS LDATIGARVL DDELKYDQDK DYLTNTAIYL
KSGIEKRLSF GYGRELWSYN DMGKLYGGAR LDISNMELSK QVMPLQMLDG RDISDVMTDE
YDKNLVSTTA MSVDIGVVWD ADFYRLGLTL ADINSPSFDY GAVGVNCDER EPGSTEQSAC
YATEYFIEVK GDLKAYETHT KHARATVDGA LHIGNKWWLT SALDLAAYDD PVGFENQWFH
LAASYETGGF WLPSLRSGYQ VNLTGSETSS LNVGATFFKM LNFDIEYGLE SVEVDGSTGP
RRLGFALSLT EHF