Gene Sde_0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0097 
Symbol 
ID3967336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp117170 
End bp118528 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content48% 
IMG OID637919156 
Producthypothetical protein 
Protein accessionYP_525573 
Protein GI90019746 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATA CTTTTTTAGA AGAGCAGTTA AAACCTGCGG GCTTAAGCCG AGAAGAATTT 
TCTGAGGTGG TAATTCGCCT TTTGGATTAC GGCGTAATAT GCCGTGACGA AAGCCAAATA
GAAGCATCGC TGTACGATCG CTATTTGCAG TGCGCCAACA CCGTAGAAGA TTACTTAGCC
GTAATTGGTG TGCGCATACA GCACGATCGT CAATTTTGCT TTTTGCGCGT ATTTCCACCC
GGTGCCAATG TGCCCGGTAT GGCAGACGAA GACTGCTCGC CATTTAACAG TGGCTTTCGC
GCTAAACCCA ATCAGCAAGA GGTGGCAGCT ATTTTAGTGT TGCGCGCAGA ATACGAAAAA
TCGCTGCGCG AAGGGCAGGT AGATGAAAAA GGGCGGGCCA TGTTATCGCT AGAAGGCCTA
GCAATAGCTA TGAACAACCT GCTTAAACGC GCCTTGCCCG ATGGCCTCGT CGAGCGCAAA
AACTTGTTTC GCCGATTGCG TCAATTGCGC TTGGTGCACT TTAATACCGA AGATGAGCTG
GATAACAGCG AAAGCTGGTT AAGCATTCAG CCCTCTATTA CAAGCTTTGT AAGCGATGAG
GTTTTATCTA CCCTGTTAGA CCAAAGCGAT GCTTCTGTAC CGGTTAACAA GCCAGCTATA
AACGAAGCGG GCAAAAGCGA AGCCAACGCA ATTGAAGATG AAGCCGATAA AGAAATAGAA
GAAGAAAATA AAATTATTCC AAGCGCGCTT TTTGGCAGCA GCGATGCAGA TAGCAGCGCA
GGCGCTGATG AAGTGGAAGA AGGTGCTACC CAAAAAGAAC CAGAGCAAGT AGACGAGCAA
CAAGCGCCGC CAGTTGTGGA AGCAAGAGCT ACAAATGTAG AAGCTACAAA GGCAGAAGCA
GTTAATAAAA AAACTGCAGA AGAAAAAGCA GCTGAAGCAA CAACAAGCAA GCAAAAAGCC
CCAGCTAAAA AAACAGCAAC TAAACCCGCT GCCAAAAAAG CCGTAGCAGC TAAAACACCA
GCAAAACCTG CAGCCAAGGC GGCGCCCGCT AAAAAGGCTG CACCTGCTAA AAAAGCAATG
GTTGCCAAAC CCGCGGCTAA GCCTGCCAGC AAACAGCCAG CTACAACTAA AAAACCAGCA
GCTAAAAAGC CTACGGCAAA ACCAGCACCC GCAAAAAAAG CCACAACGGC TAAAGCTGCG
GTAAAAAAAG CGCCTGCTAA ACCGGCCGCA ACTAAAGCAA CAGCCACTAA AACCCCCGTC
GCTAAAAAAC CAGCAAAAAA AGCACCTGCA AAAACAGCAG CAGCTAAAAA ATCACCGGCT
CGCAAAGCCC CAGCAAAACC TAAAGGGGGT AAAGCCTAA
 
Protein sequence
MIDTFLEEQL KPAGLSREEF SEVVIRLLDY GVICRDESQI EASLYDRYLQ CANTVEDYLA 
VIGVRIQHDR QFCFLRVFPP GANVPGMADE DCSPFNSGFR AKPNQQEVAA ILVLRAEYEK
SLREGQVDEK GRAMLSLEGL AIAMNNLLKR ALPDGLVERK NLFRRLRQLR LVHFNTEDEL
DNSESWLSIQ PSITSFVSDE VLSTLLDQSD ASVPVNKPAI NEAGKSEANA IEDEADKEIE
EENKIIPSAL FGSSDADSSA GADEVEEGAT QKEPEQVDEQ QAPPVVEARA TNVEATKAEA
VNKKTAEEKA AEATTSKQKA PAKKTATKPA AKKAVAAKTP AKPAAKAAPA KKAAPAKKAM
VAKPAAKPAS KQPATTKKPA AKKPTAKPAP AKKATTAKAA VKKAPAKPAA TKATATKTPV
AKKPAKKAPA KTAAAKKSPA RKAPAKPKGG KA