Gene Sde_1073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1073 
Symbol 
ID3968237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1373072 
End bp1374787 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content46% 
IMG OID637920141 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_526547 
Protein GI90020720 
COG category[R] General function prediction only 
COG ID[COG3176] Putative hemolysin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.475912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAACA TCGAACAAGC TGTTACTAAC AAGTTTCCTA AGTTTGCTAG CCAACCGGCT 
ATTATCCGCA AACCTACACT CTCCCTCCTG CGCCGACTTA CCCACGAAAC TGAAATTAAT
GCCTTTTTGC GCGACAACCA AGATGCCATT GGCTTTGAGT TTATCGACCG CGTACTGGAA
TATTTCGACT TTAGCTATCG CGTAAGCGCT AGAGATAAAA GCAATATTCC CGCTGCTGGT
CGCGTAGTTA TTTTCGCCAA CCACCCCATT GGCTCTCTCG ATGGCTTAGC TATTTTGCGG
CTTATTGGCG AAGTGCGCCA AGATGTAAAA ATTATCGCTA ACGATATGCT CAGCCATTTC
TCTGCCCTAG ATAACCTGAT TATTCCACTG GATAACATGA CCGGCAGCAG TGCCCGCCGC
AGCTACAAGC GCGTAATGCA GGCACTAGAA AAAGAGCAGG CGATTATTGT ATTCCCTGCG
GGTGAGGTAT CCCGCGCCAG TGCTAATGGT GTGCGCGATT CTCGCTGGCT GCCCGGCTTT
TTACATTTTG CTCGCCGCGG CAAGGCGCCG TTGCTGCCGG TGCATATCAA AGCAAAAAAC
TCATTACTCT TTTACGGCGC CAGCATGCTG TTTAAGCCAC TTGGCACTGC ACTACTGGCA
AGAGAAATGT TTAACAAGCA ATCGCGCACC ATTAACTTTC GTGTCGGCGG TATGATTCCG
CCAGCGGCTT TAGAATCAGA CCAGTTACAC GATCGCACAC TAGTAAAACG TCTCAAAAAG
CATTTATATA AAGTTGGCTC GCAAAAACGC CCTATATTTC AAAGTGAACG CACCATAGCC
CACCCAGAAG ATCGTCAGCG TTTACAAGAG GAATTGCGCG ATGCAAAGCT ACTGGGCGAG
ACCCGAGACA ACAACCGCAT CTACTTAGTA AGTTACAAAC AAGATTCCGC AGTTATTCGC
GAGATTGGCC GCTTGCGCGA ACTCGCCTTT AGAAAAGTAG GTGAAGGTAC AGGTAAAAAG
CGCGATTTAG ATGCGTTCGA TAGCCACTAT AAGCATTTAG TATTGTGGGA TAGAGAAAAC
CTAAAAATTG CAGGCTCTTA TCGCCTAGGC GAGGGCAAAC ACATCTACGA TACACTTGGT
GAAAGCGGCT TTTACACCAG CACGCTATAT GACTTTAAAC CAGAATTTAA AAAATATTTA
GAGCAAGGTG TAGAGTTAGG CCGCAGCTTT GTAAACCCTG AATACTGGGG TAAAGCGAGC
CTTGATTATT TGTGGCAAGG GCTGGGAACA TTTTTAGCTA ACAACCCCGA AGTACGCTAT
TTAATTGGGC CAGTAAGTAT GAGCGCAGAT TACCCGCGCG AGCTTATGGA CCAACTGGTT
TATTTTTACC GCCGTTACTA TGCATGCCCC GAAAATTTAG CGGTAGCGAA TCACCCCTAT
ACTTTAACGG CCGAGCAAAA CGCAAAGTTT GAAGCTCTGT TTGAAAATAA AGAGCGCGAT
GAAGCCTTCG ATTTTATGCA GGCAAATTTT ATGGCAGTCG GCCACAAACT TCCTATGCTA
TTTAAACAAT ACACAGCACT CTTTGAAGAA GGTGGTTTTC AGTCGGTAGT ATTTAGCGTA
GACCCAGACT TTGGCGACTG CCTAGACGGT TTATGCATGT GCGACATTAC TAAGCTTAAA
GCCAGTAAAT ATAAGCGTTA TATTGGGGGG AAATAA
 
Protein sequence
MINIEQAVTN KFPKFASQPA IIRKPTLSLL RRLTHETEIN AFLRDNQDAI GFEFIDRVLE 
YFDFSYRVSA RDKSNIPAAG RVVIFANHPI GSLDGLAILR LIGEVRQDVK IIANDMLSHF
SALDNLIIPL DNMTGSSARR SYKRVMQALE KEQAIIVFPA GEVSRASANG VRDSRWLPGF
LHFARRGKAP LLPVHIKAKN SLLFYGASML FKPLGTALLA REMFNKQSRT INFRVGGMIP
PAALESDQLH DRTLVKRLKK HLYKVGSQKR PIFQSERTIA HPEDRQRLQE ELRDAKLLGE
TRDNNRIYLV SYKQDSAVIR EIGRLRELAF RKVGEGTGKK RDLDAFDSHY KHLVLWDREN
LKIAGSYRLG EGKHIYDTLG ESGFYTSTLY DFKPEFKKYL EQGVELGRSF VNPEYWGKAS
LDYLWQGLGT FLANNPEVRY LIGPVSMSAD YPRELMDQLV YFYRRYYACP ENLAVANHPY
TLTAEQNAKF EALFENKERD EAFDFMQANF MAVGHKLPML FKQYTALFEE GGFQSVVFSV
DPDFGDCLDG LCMCDITKLK ASKYKRYIGG K