Gene Anae109_3441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3441 
Symbol 
ID5376575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4041773 
End bp4043554 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content71% 
IMG OID640844966 
Productpeptidase U34 dipeptidase 
Protein accessionYP_001380609 
Protein GI153006284 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4690] Dipeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.534665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.658242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGTC GCTCGCTCGC CGTCCCGCTC GCCGCCCTCT TCGCGGCCAG CGCGCTCGCC 
GCCCGCCCCG CGCGCGCCTG CACGAACATC CTGGTCTCCC GCGGCGCCAC CGCGGACGGC
TCCACGCTCG TCACCTACGC CGCCGACTCG CACGACCTGT ACGGCGAGCT GTACTACACC
GCCGCCGCCC GCCACCCGGC CGGCGCCGCG CGCGAGGTCG TCGAGTGGGA CACGGGCGAC
CGCCTCGGCC AGATCCGGCA GGCGGAGGTC ACCTACAACG TCGTCGGCAA CATGAACGAG
CACCAGGTCG CGATCGGCGA GACCACCTTC GGCGGCCGCG ACGAGCTCGT CGACCCGAAG
GGCGGCATCG ACTACGGCAG CCTCATGTAC GTCGCGCTCG AGCGGGCGCG CACGGCGCGC
GAGGCGATCG AGGTCATGAC CTCGCTCGTC GCCGAGTACG GCTACCGCAG CTCGGGCGAG
TCCTTCTCGA TCTCGGATCC GAAGGAGGTC TGGATCCTCG AGATGATCGG CAAGGGGCCG
GGGCGGAAGG GCGCGGTGTG GGTCGCCCGG CGCGTGCCCG ACGGCCACGT CTCGGCCCAC
GCGAACCACG CGCGCATCCG CCAGTTCCCG CGCAACGATC CGAAGAACAC CCTCTACGCG
AAGGACGTCG TCTCGTTCGC CCGCGAGAAG GGCTGGTTCA CGGGCAAGGA CGAGGACTTC
AGCTTCTCCG ACACCTACGC GCCCCTCACC GCCGGCGCCC TGCGCGCGTG CGAGGCGCGG
GTGTGGAGCG TGTTCCGCCG GGTCGCCCCG TCGCTGAACC TCTCCGTCGA GCACGTGACG
GGCGGCCCGA GCGCGCCGCG CCTGCCGCTG TGGGTGAAGC CCGACGCCAA GGTGTCGGTC
CGTGGCGCGA TGGAGCTCAT GCGCGATCAC TTCGAGGGGA CCCCGCTCGA TCTCTCGCAG
GGCGTGGGCG CCGGCCCGTT CGCGCTGCCC TACCGGTGGC GGCCGATGAC CTGGAAGGTG
GACGGGGCCG AGTACCTCCA CGAGCGCGCC ATCTCCACGC AGCAGACCGG CTTCTCGTTC
GTGGCGCAGG CGCGCGAGTG GCTCCCGGGG CCGATCGGCG GCGTCCTCTG GTTCGGGCTC
GACGACACCT ACTCCACCGT CTACGTCCCC CAGTACTGCG GCAACCGCGC GGTCCCGCGG
ACGTTCGGCG TGGGCAGCGG CAACTTCCAG GAGTTCAGCT GGGACTCGGC GTTCTGGGTC
TTCAACTTCG TCTCGAACTG GGCGTACGGG CGCTACAGCG ACATGATCCA GGACGTCCAG
AAGGTGCAGC GGGAGCTCGA GAGCGGGTTC CTCTCGCGGC AGGAGGACGT GGAGAAGGCC
GCCCTCGCCC TCCACAAGAC GTCCCCGGGG CTCGCGCGCG ACTACCTCAC CCAGTACTCG
GTCGAGCAGG GCGACCGCAC CACCGCGCGC TGGCGCAGGC TGGGAGAGAC GCTCATCGTG
AAGTACCTGG ACGGGAACGT GCGCGACGAG CACGGCAAGG TGAAGCATCC CGATTACCCG
GAGGGCTGGC GGCGGCGGAT CGCGCAGGAC CGGGGGGAGC AGATCAAGGT GGTGAGGTTC
CCGGGGGAGA AGGAGGAAGA GGAGCCGACC TCGACCCCGA CCTCGACGGC GCCGGATCGG
ACGGCCGCTC ATCCCGAGCG CAGCGACCCC GCGGCCGCGG CGGCGCGGAG TCGAGGGACG
ACCCCGACCC CGAGCGGATC TCCTGAGGCC GCTCGCCCGT GA
 
Protein sequence
MIRRSLAVPL AALFAASALA ARPARACTNI LVSRGATADG STLVTYAADS HDLYGELYYT 
AAARHPAGAA REVVEWDTGD RLGQIRQAEV TYNVVGNMNE HQVAIGETTF GGRDELVDPK
GGIDYGSLMY VALERARTAR EAIEVMTSLV AEYGYRSSGE SFSISDPKEV WILEMIGKGP
GRKGAVWVAR RVPDGHVSAH ANHARIRQFP RNDPKNTLYA KDVVSFAREK GWFTGKDEDF
SFSDTYAPLT AGALRACEAR VWSVFRRVAP SLNLSVEHVT GGPSAPRLPL WVKPDAKVSV
RGAMELMRDH FEGTPLDLSQ GVGAGPFALP YRWRPMTWKV DGAEYLHERA ISTQQTGFSF
VAQAREWLPG PIGGVLWFGL DDTYSTVYVP QYCGNRAVPR TFGVGSGNFQ EFSWDSAFWV
FNFVSNWAYG RYSDMIQDVQ KVQRELESGF LSRQEDVEKA ALALHKTSPG LARDYLTQYS
VEQGDRTTAR WRRLGETLIV KYLDGNVRDE HGKVKHPDYP EGWRRRIAQD RGEQIKVVRF
PGEKEEEEPT STPTSTAPDR TAAHPERSDP AAAAARSRGT TPTPSGSPEA ARP