Gene Sde_0157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0157 
Symbol 
ID3964791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp194498 
End bp195706 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content43% 
IMG OID637919216 
Productglycosyltransferase-like protein 
Protein accessionYP_525633 
Protein GI90019806 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTTA CCCCCTTTCC CGCCAGAAAG AACGGTTTAA CCCTACGCTA CTATGCATTG 
GCAAAATCAC TTAGCGAAAC CGACAACGAG GTATCGCTAC GTATATTTCC ACACATAGAA
AAAACGTGGA ACCAAGACGA TATAGATGCG TTACAAGCAG CGGGTATTAA AACAACCGCG
GCCGATAGCT ACCCCATTAA AACCCCATCT ATCACCCAAA AAATACTCGC ACGAGTTAGT
GCACTTTTAC CTTTTGGCAA ACCGCTTTCA CAAATTTCAT ACAATTATAA AACCGCTTGC
GATTACTTTA ATACAAGCGA AAACTTTTCA GACTTCGATG TAGTACTCGC CGTAACAAGT
TCTATTTTTG AGCTGTACGA TCAACTACCA AAAGCAAAAA AGGCAAAAAT GGTCGTTTGG
GATGAAGTAG ACTCACTACC GCTACATTTT TACCGACGCT ACCAAGCCGG AGATAGCGTT
TTGCTGTCCA AGAAGTATGA ATTTATTAAA TACAAAGCTT GGGAGCGAGA TCTAATAAAC
AGAGCAGATA AAGCCTGTTA TATCTCTAAA GTGGATGCGA GCTTTTCTAA AGGGGATGCA
GACAAAATAA TTGTTCTACC CAATGGCATT GAGCACAGCG AATTACACGA TGCCCCCCTT
GTAGACCTGC ACAGCGACAG CATAGGGTTT GTTGGCGATA TGGCCTACAG GCCAAACGTT
AAAGCCGTAA AATGGTTTTT AGACAACGTG TGGCCAGACT ACGTAGAAAA AAATCCTAGC
TGCTACTTTT ATATTGTTGG CCGCAGCCCT GAACAAAGCG TTTACGATGA AGCAGCCAAA
CATAAAAATG TTGTTGTTAC CGGCTCGGTA GATAACATTT GGTCCTACTA TAAATCTATT
AAACTATTTG TGTGCCCGCT GTTTAGCGGA GCGGGTTTAC AAAACAAAGT TATTGAAGCC
ATGTTTGCTT CTAAGCCAGT TATTTCTACA ACAATAGCGA ATGCCGGTGT GAATGCGGTT
AATGGCGAAT CGATAGTACT AGCTGATACG GCCGAACAGT TTACGCAAGC ATTAAACACA
CTTAGACACG ACGCGAGCGA AGCGCACCGT ATTGGTGATG AAGGCAGAAG GTACGCAACT
AAGAATTTTG ATTGGTCAAC GCTCACCGAA CAACTGCTAA CAACATTCAG GCGTCATTTG
GGATACTAA
 
Protein sequence
MGVTPFPARK NGLTLRYYAL AKSLSETDNE VSLRIFPHIE KTWNQDDIDA LQAAGIKTTA 
ADSYPIKTPS ITQKILARVS ALLPFGKPLS QISYNYKTAC DYFNTSENFS DFDVVLAVTS
SIFELYDQLP KAKKAKMVVW DEVDSLPLHF YRRYQAGDSV LLSKKYEFIK YKAWERDLIN
RADKACYISK VDASFSKGDA DKIIVLPNGI EHSELHDAPL VDLHSDSIGF VGDMAYRPNV
KAVKWFLDNV WPDYVEKNPS CYFYIVGRSP EQSVYDEAAK HKNVVVTGSV DNIWSYYKSI
KLFVCPLFSG AGLQNKVIEA MFASKPVIST TIANAGVNAV NGESIVLADT AEQFTQALNT
LRHDASEAHR IGDEGRRYAT KNFDWSTLTE QLLTTFRRHL GY