Gene Sde_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0023 
Symbol 
ID3968156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp26674 
End bp27873 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content50% 
IMG OID637919082 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_525499 
Protein GI90019672 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTAACCT TAATAAAAGG TATTCTCAAC GGCTGTAAAC CAATCACGGA TGAAAATATG 
CTTAGTTCAA TTGAAGAGCT CATACTGTTT ACTCGCCTAC CAGGTGTAGG TGCGGCCACA
TACTGGCAGT TGTTAGATCG CTTTCCTAGC ATACACTCTG CGCTGCAAGC CTCACCCGAA
GCACTTAAAC CATTTCTCTC GCAAGAAGCC CTCGATACCC TAGCGTTGGT ACGCAGCCAA
AAATCTGCCT CTATGGCTGT GCAGCAGGTG CAGCGGGATA TGGATTGGCT GCAAAAAAAC
GACATTACCC TAGTAGATAC TGACCACACC GCCTACCCAG AGTTACTGCG CGAAATAAAA
CGTACGCCGC CATTGCTGTA CGTCAAAGGT TGCCCGGCGA GTTTAAACTT TCCTCAGGTG
GCCATTGTGG GCAGCCGCAA GCCCACGCCT GCTGGCCGCG ACACTGCTCA GGCCTTTGGC
TCCGATTTGG CAAAATCGGG TTTTACCATT ACCAGTGGCT TGGCTATGGG TATTGATGCC
GCCGCGCACG AGGGCGCCGT TAAAGTTAAA GGCCGTACCA TTGCAGTAAT TGGTACCGGC
ATAGATAGCG TTTACCCCCA GCGCAATAGC GCATTAGCTA GCGAAATTAT TGCTAACGGT
GGTGCAATAG TAAGTGAGTT CCCCTTGGGT ACCGACCCAC AACCGCAAAA CTTTCCACAG
CGAAACCGTA TAGTTAGCGG TTTAAGTTTT GGTGTGGTGG TGGTCGAAGC GGCGGTAAAA
AGTGGCTCTC TTATCTCTGC GCGCTATGCA TTGCAGCAAA ACAGAGAGTT GTTTGCGGTG
CCTGGCTCCA TCCACAACCC TTTAAGTCGT GGTTGCCACG CATTAATAAA AGAAGGCGCC
AAGTTGGTAG AAACCTCGCA AGATATTGTC GATGAGCTAG GCGGCTTCTT ATCGCGCCAG
CGCGATTTAT TAGATATTTA CAAGCAGCCC GCAGAAAATA GTTTGCCAAA ACACGACGAG
CTTATAGCTA ACGATTTAGA AGACGATGTA CTGGCAAAAC TAGATTACAG CCCAACCCCC
ATCGACGCTT TAGCCGAGCG CACCAAAAAG CCCATTGGCG AAGTTATGTC TTGTTTGCTC
ACCATGGAGC TAAAAGGCTT AGTGGCCAAC TTGGGTGCAG GCTATATGCG GTTGCGCTAG
 
Protein sequence
MVTLIKGILN GCKPITDENM LSSIEELILF TRLPGVGAAT YWQLLDRFPS IHSALQASPE 
ALKPFLSQEA LDTLALVRSQ KSASMAVQQV QRDMDWLQKN DITLVDTDHT AYPELLREIK
RTPPLLYVKG CPASLNFPQV AIVGSRKPTP AGRDTAQAFG SDLAKSGFTI TSGLAMGIDA
AAHEGAVKVK GRTIAVIGTG IDSVYPQRNS ALASEIIANG GAIVSEFPLG TDPQPQNFPQ
RNRIVSGLSF GVVVVEAAVK SGSLISARYA LQQNRELFAV PGSIHNPLSR GCHALIKEGA
KLVETSQDIV DELGGFLSRQ RDLLDIYKQP AENSLPKHDE LIANDLEDDV LAKLDYSPTP
IDALAERTKK PIGEVMSCLL TMELKGLVAN LGAGYMRLR