Gene Sde_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1033 
Symbol 
ID3967787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1321308 
End bp1323032 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content47% 
IMG OID637920100 
Productputative oxidoreductase chain 
Protein accessionYP_526507 
Protein GI90020680 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0731446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATC CTTTTTTAAT TAAAAAAACC ACCAAAAAAT TTGACGCTAT TGTTGTCGGC 
TCTGGCATTT CTGGTGGATG GGCTGCCAAA GAACTGTGTG AGCGCGGCTT AAAAGTTTTG
CTGGTAGAGC GCGGCCGCGC AGTGCGCCAC CAGAAAGATT ACGAAGGCGA ATTTAAAGCA
CCGTGGGACC TGCCACACAG GAACCAAGTG GATTACAAGT TAGTACAAGA CCAGCATCAC
GTGCAACGCA AGTGCTACGC CTTTAATGAT GCGACCAAAC ACTTTTTTGG CAACGACCGC
GACTACCCCT ACATAACCAA AGAGGGCACA GGCTTTGATT GGATACGAGG CAATCAACTC
GGTGGCCGCT CATTGCTATG GCACCGTCAA TCCTTCCGCT GGGGAGAAAT AGACTTTGAA
GAAAACCTGC GCGACGGCAA CGGTTGCGAC TGGCCAATTC GCTATAAAGA CTTAGAACCG
TGGTACTCCC ATGTAGAAAA ATTCGCGGGT ATTTCAGGTA GCGTAGAAAA CATACCCACC
TTGCCCGATA GCGAATTTTT GCCACCGTTT GAGATGAACG CTGCCGAAAA ATATGCAAAA
AAAGCTATCG AAAAAAAATA CCCTCATATG AAATTCATAC AGGGTCGCTG TGCGCACTTA
TCTGAGCCCA CCCAATTTTT TCTAGATCAA GGCCGCGGTA AATGTATGGC CCGCAATCAG
TGCCAGCGCG GCTGCTCTTT TGGTGCATAC TTTTCTACCC TTAGCTCCAC GTTGCCTGCG
GCGATCAAAA CAAACAACTT AAGCATTGCC TGCAACAGTG TTGTACACAG TGTTATTTAC
GATGAGAAAA GCGGCAAAGC AACCGGTGTA AATGTAATAG ACGAAGAAAC ATTAGAAACT
CGCGAATATT ACGGCGGCAT GATTTTTCTG TGCGCCTCAA CACTTGGCAC CACCCAAATA
ATGCTTAATT CTAAATCCAA AGCTTTTCCA AATGGTATTG CCAACTCTTC GGGCGTGCTC
GGTCACTACT TGATGGATCA CATTTATAAT TCTAGCGCTT ACGGTGTGTT AGAAGGCTTT
GAAGATGACT ACTACAAAGG GCAGCGCCCA ACAGGCCCTA TTATTCCTCG GTTTAAAAAC
TTAAAGAAAA ATAGCGAGAA GTTTAATCGC GGTTATTTTT TACGCGGCGG CGCGGTTCGG
CCAGTTACCT ATAGCTCCCC CGATGACAAA ACCTTTGGCG TGGAATTAAA AAACAAGCTA
CAAAAACCAG GGCCTTGGAT TCTTCATTTC GGCGGCTCAG GCGAAATGAC ACCCAAGTAC
GAAAACATGG TTAGCTTACA CCCAACCAAA ACCGACAAGT GGGGCATTCC ACTATTAGTT
TTCGATTGTA AATTCACCGA AAACGACAAG CTGATGATGG AAGACATGGC CGATACTGCT
GCAGAAATTC TCACAACAAT TGGCTGCAAG CATGTCACTA AAGATATTAG CGATGCGCCC
CCCGGCTTAG CCATTCACGA AATGGGCACC GCACGCATGG GCCGCGACCC CAAAACCTCG
GTATTAAATG GCAACAACCA ATGCCACGAT GTACCCAACC TATTTGTAAC AGACGGTGCA
TGTATGGCCT CTACTGCTCA CCAAAACCCA TCACTTACCT ATATGGCAAT TACCGCGCGC
GCAGCTGCCT TTGCTGCAGA GCAATACAAA AATAAAGCGC TCTAA
 
Protein sequence
MIDPFLIKKT TKKFDAIVVG SGISGGWAAK ELCERGLKVL LVERGRAVRH QKDYEGEFKA 
PWDLPHRNQV DYKLVQDQHH VQRKCYAFND ATKHFFGNDR DYPYITKEGT GFDWIRGNQL
GGRSLLWHRQ SFRWGEIDFE ENLRDGNGCD WPIRYKDLEP WYSHVEKFAG ISGSVENIPT
LPDSEFLPPF EMNAAEKYAK KAIEKKYPHM KFIQGRCAHL SEPTQFFLDQ GRGKCMARNQ
CQRGCSFGAY FSTLSSTLPA AIKTNNLSIA CNSVVHSVIY DEKSGKATGV NVIDEETLET
REYYGGMIFL CASTLGTTQI MLNSKSKAFP NGIANSSGVL GHYLMDHIYN SSAYGVLEGF
EDDYYKGQRP TGPIIPRFKN LKKNSEKFNR GYFLRGGAVR PVTYSSPDDK TFGVELKNKL
QKPGPWILHF GGSGEMTPKY ENMVSLHPTK TDKWGIPLLV FDCKFTENDK LMMEDMADTA
AEILTTIGCK HVTKDISDAP PGLAIHEMGT ARMGRDPKTS VLNGNNQCHD VPNLFVTDGA
CMASTAHQNP SLTYMAITAR AAAFAAEQYK NKAL