Gene Dfer_3653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_3653 
Symbol 
ID8227238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp4451213 
End bp4452733 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content54% 
IMG OID644931485 
Productpeptidase M28 
Protein accessionYP_003088023 
Protein GI255037402 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0164431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAT CATTACTTCT GTTACCACTC GGTCTGGCCC TTACCCTTCC TGCCGAAGCC 
CAGAAAAAAA CACAGTCATT ACCTGAGTTT CAGGTCAAAA AATCGGAAAC CGAAGCGCAT
ATCCGCTTCC TTGCAGGCGA CGAGCTGATG GGCCGCCGCA CGGGCGAGCA GGGTAACTTT
GTAGCAGCGC GCTACATTGC CGAGCAATTC CGGAAAATGG GCGTGGTACC CGCGCCAGGC
AATACCGAAA CGGGCACATC GAGCTATTTC CAACGCGTTC CTTTCGAAAA AATGGGTGCT
AACGGCACCG GCGAGATCGC CGCGGATGCC GAGATCATGA AAAGCGGCAC CGACTGGATA
CTCATGGCCG GCGAAGCCGT AGAGCTCAAA GCGCCGATCA TTTATGCCAG CTACGGCCTT
GAAAACGCAG CCAAAAGCTG GGATGACTAC AAAGGGTTGG ATGTGAAAGG AAAAATCGTG
CTGGTGGAAA GCGGCACGCC CGAGAACCAG ACACCCTCGG AAATCTTCGC TACTTCTGCC
GAAAAACGTA CAATAGCTAT CGACAAAGGC GCTATCGCGG TGATAGAGCT TTTTAACGCA
CCTATTCCCT GGAATGTGGT GAGTAAGTTT TTTGCAGGAG AAAAAATATC GCTGGCCGAA
GGCACGGCTT CCCAATCCAT CCCGCATGCG TGGGTAAACG GCAAGGAGGC CAAATTCGCC
CGGGCATTGC GCGCGGTGAA AGAGGTGACG TTCAAAACCT CGGGCCGTGT TGCAAAACCC
ATTTACAGCT ATAATGTAGC CGGCTACATT CCAGGCACCG ATCCCAAACT GAAAGAGGAA
TATGTGCTCC TTTCCGCACA TTACGATCAC GTGGGCGTAG GCAAGCAGGG CGGGCAAACG
TACACGCCGG AGGACAGCAT TTTCAACGGC GCCCGCGACA ATGCATTCGG CGTTACCGCG
TTGCTCACCG CGGCCGAAGC ATTGGCCAAA AATCCGCCTA AACGCTCGAT TCTGCTCGTT
GCGCTGACGG GCGAGGAAGT GGGCTTGCTA GGCAGTAAAT ACTACGCGTC ACATCCGATC
ATGCCTCTGA ACAAATGCAT TTTCAATATG AATTCCGATG GTGCAGGCTA TAACGACACC
ACCATCGTAT CGGTAATGGG CCTCGACCGC ACCGGCGCGC GCGCGGAGCT CGAGGCGGCT
TGTAAGGCAT TCGGCCTGGG CATTTTCGCC GACCCATCAC CGGAACAGGG GCTTTTCGAC
CGTTCGGATA ATGTGAGTTT TGCCAGAGAA GGCATCCCCG CACCCACGTT CACACCCGGT
TTTAAAGAAT TTAACGGGGA TATTATGAAA AATTACCATC AGGTAACCGA CAACCCCGAG
ACAATCGACT TCAACTACCT GTTGAAATTC AGCCAGGCCT ACACCTACGC CACCAGGCTC
ATCGCCGACC GCAAAACAGC CCCGCAATGG AGCCCCGGCG ACAAGTACGA GCCCGCCGCG
AAGAAGCTGT ATGGAAAATA G
 
Protein sequence
MKLSLLLLPL GLALTLPAEA QKKTQSLPEF QVKKSETEAH IRFLAGDELM GRRTGEQGNF 
VAARYIAEQF RKMGVVPAPG NTETGTSSYF QRVPFEKMGA NGTGEIAADA EIMKSGTDWI
LMAGEAVELK APIIYASYGL ENAAKSWDDY KGLDVKGKIV LVESGTPENQ TPSEIFATSA
EKRTIAIDKG AIAVIELFNA PIPWNVVSKF FAGEKISLAE GTASQSIPHA WVNGKEAKFA
RALRAVKEVT FKTSGRVAKP IYSYNVAGYI PGTDPKLKEE YVLLSAHYDH VGVGKQGGQT
YTPEDSIFNG ARDNAFGVTA LLTAAEALAK NPPKRSILLV ALTGEEVGLL GSKYYASHPI
MPLNKCIFNM NSDGAGYNDT TIVSVMGLDR TGARAELEAA CKAFGLGIFA DPSPEQGLFD
RSDNVSFARE GIPAPTFTPG FKEFNGDIMK NYHQVTDNPE TIDFNYLLKF SQAYTYATRL
IADRKTAPQW SPGDKYEPAA KKLYGK