Gene Dfer_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_3947 
Symbol 
ID8227542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp4793507 
End bp4795507 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content54% 
IMG OID644931788 
Productaminopeptidase precursor 
Protein accessionYP_003088316 
Protein GI255037695 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTT CAGTATTTCG TCTGGGCGTC ATTGCAATGA GCTGCCTGGG CTTCTTCAAC 
AACACTATTG CACAGGAGGC GGAGAAGGTG AAATATGATC ACCATGTCCT TTTTCATCCC
CTGTTTAATT TCCAGCCGGG CAACGAATAC CGGACGGGAA GCGGTGCTCC GGGGCCAAAG
TACTGGCAAA ACCGGGCCGA TTACAAGATT AACGTTACGC TCGAAGAGGA AAAAGGCACT
GTGAGCGGGG AGGTGGAGCT TACCTATAAA AACAATAGTC CCGAAAACCT GGAATTTATC
TGGCTGCAAC TCGACCAGAA TGCATTCGGA AGCCATTCGA GAGGTGCGCA GACGACGCCG
GTAACGGGCG GCCGTTTTGG CAACGCAGGT TTCGATGGAG GCGATTCCAT TAAATCGGTA
AGCGTGCAGC AAGGCAAAGG CTCGTTCGTG GATGCGGAAT ACAAAATCAC CGACACCCGC
ATGCAGATCC GCCTCAAAAC ACCCATGAAA GGTAATGGCG ATGTGATCAA AATCAAGCTG
GCCTATTCAT TCAAAATCCC CGAATACGGC TCCGACCGCA TGGGAACGCT CGAAACCAAG
AATGGTATCG TGTACGAAAT GGCCCAATGG TATCCGCGCG TGGCTGTTTT CGACGACATC
GAAGGCTGGA ACCTGCTGCC CTACCTCGGC GCCGGCGAGT TTTACCTCGA ATATGGCGAT
TTTGAATACA ATGTAACTGT TCCCTGGGAC CATATCGTGG TAGGTTCGGG CGAATTGCTC
AATCCGAATG AAGTACTCAC CGCCGAGCAG CGCAAAAGAC TGGCTGACGC CGCAAAAAGC
GACCAGACCG TCGTGATACG CAGCGCGGAG GAGGTGACCA ACCCCAACAC GCGGCCTAAG
CAATCGGGCA CGCTCACATG GCGCTTCCGC TGCTTGCAGG CGCGCGACAT TGCCTGGGCA
AGTTCCAAAG CATTTGTATG GGATGCGGCG CGGATGAACC TGCCGAAAGG CAAAACGGCA
CTGGCGCAGT CGGTGTACCC TGCCGAGGAT GGCGGGCTGG AAGGCTGGGG CCGCTCTACG
GAATATGTGA AGGGCTGTAT CGAGTTTTAT TCCAACTACA TTCATGAATA CACCTACCCC
GTGGCGACCA ACGTGGCCGG CATCGTGGGT GGTATGGAAT ACCCGGGCAT TGTGTTCTGC
AGCAGCAAAA GCCGCAAGGA CGATTTGTGG GGCGTAACCG ACCACGAGTT TGGTCACAAC
TGGTTCCCGA TGATCGTTGG TAACAACGAG CGCAAATATG GATGGATGGA TGAGGGTTTC
AACACGTTCA TCAACTTCCT TTCGAGCGAT AACTTCAATA ATGGCGAATA CAAAACGACC
CAGATGAACG ATATGCACCG TCTGGCGCCG ATTATCTTCC GCCCGAAAGC CGACCCGATC
ATGACGATCC CCGACGTGGT GCAGGCCGTA AACCTGGGCT GGGAAGCGTA TTACAAGCCT
GCTTTGGGCC TGAAAATGCT TCGGGAGCAG GTTTTAGGCA AAGAGCGCTT CGACTATGCG
TTCAAAATTT ACGTGCAGCG CTGGGCATTC AAGCATCCTA CGCCTTACGA CTTCTTCCGT
ACGATGGAGG ACGCCGCCGG CGAAGACCTG GGCTGGTTCT GGAAAGGCTG GTTCTTCGAG
AACTACAAGC TCGACCAGGC GGTGAAGCAG GTTGCCTACG TGGAGCAAAA CCCGCAAAAA
GGATCTTACA TTACGATCGA GAACCTGGAC CAGCTGGCGA TGCCTGTGAA AGTGGATATC
GAGGAAGTGA GCGGCAAAAA GACGCGCGTA GAACTGCCGG TGGAAGTATG GCAGCGCGGC
GGTACGTGGA CTTTTAAAGC GGCTACTCAG CAGCCAATCC GCTCGGTAAC GATCGACCCG
GACCGCAACC TTCCGGACAT TAATCCTGAA AACAATGTGT GGAAACCGGC CTCATACAAC
TCCGAGCCCG ATGTGAACTA G
 
Protein sequence
MKISVFRLGV IAMSCLGFFN NTIAQEAEKV KYDHHVLFHP LFNFQPGNEY RTGSGAPGPK 
YWQNRADYKI NVTLEEEKGT VSGEVELTYK NNSPENLEFI WLQLDQNAFG SHSRGAQTTP
VTGGRFGNAG FDGGDSIKSV SVQQGKGSFV DAEYKITDTR MQIRLKTPMK GNGDVIKIKL
AYSFKIPEYG SDRMGTLETK NGIVYEMAQW YPRVAVFDDI EGWNLLPYLG AGEFYLEYGD
FEYNVTVPWD HIVVGSGELL NPNEVLTAEQ RKRLADAAKS DQTVVIRSAE EVTNPNTRPK
QSGTLTWRFR CLQARDIAWA SSKAFVWDAA RMNLPKGKTA LAQSVYPAED GGLEGWGRST
EYVKGCIEFY SNYIHEYTYP VATNVAGIVG GMEYPGIVFC SSKSRKDDLW GVTDHEFGHN
WFPMIVGNNE RKYGWMDEGF NTFINFLSSD NFNNGEYKTT QMNDMHRLAP IIFRPKADPI
MTIPDVVQAV NLGWEAYYKP ALGLKMLREQ VLGKERFDYA FKIYVQRWAF KHPTPYDFFR
TMEDAAGEDL GWFWKGWFFE NYKLDQAVKQ VAYVEQNPQK GSYITIENLD QLAMPVKVDI
EEVSGKKTRV ELPVEVWQRG GTWTFKAATQ QPIRSVTIDP DRNLPDINPE NNVWKPASYN
SEPDVN