Gene Anae109_0118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_0118 
Symbol 
ID5375224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp145474 
End bp147537 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content74% 
IMG OID640841630 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001377320 
Protein GI153002995 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.125913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.195999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACCT CCGCCCTCGC CGCGCTCGCC GTGGCCGCGG CCGCGGCCGC GCCCCGGCCG 
TTCACGGTGG ACGACCTCGT GGCGCTCCCG CGGGTCGGCG CGCCGGAGCT CTCTCCGGAC
GGTACCCTCC TCGCCTTCAC CGTCGGGCGA CTCACCGGCG CCGGCGATCG GATCGTCTCC
ACGCTGTACG TGGCGCCCGC CGCGGGCGGA GCGAGCAGGC AGCTCACCGA TCGTGACGAG
CGCATCTCCT CGCCGCGCTT CTCCCCGGAC GGGAAGCGGC TCGCCTTCGT CTCGACCCGG
AGCGGCGCTC CCCAGGCGCA CGTCCTCGAC CTCGCCTCGG GCCAGATCCG CGTCGCGAGC
GCGCTGCCCG GCGGCGCGAA CGAGCTCTCG TGGACGCCGG ACGGGGCCGC CCTCCTCGTC
ACCGCGGACG TGGACCCGCG CTGCGGCGCC GACGCCGGCT GCAACGCCAA GGCGGAGGCC
GAGGCGAAGG GCAAGCCGCG CGTCGCGACC CGCCTCCTCT TCCGCCACTG GAACGCGTGG
CGCGAGCGGC TGCGCACCCA CGTGCTGAAG GTTCCCCTCG ACGGCGGGGC GCCCTCGGAC
CTCACGCCCG GCGATCGCGA CGCTCCGCCC GCGGTCCGCG GCGGCCCGGG CGACCTCGCG
GTCTCCCCCG ACGGGAAGAC CCTCTACTTC ACCGCCGTCG ACGACGCGCT CGAGGCGGCC
TCCACCAACG CGGACGTCTT CGCGGTCCCG CTCGCGGGCG GCGAGGCCCG CCGCGTTACG
AGCGGGCCGG GCTGGGACGC GAGCCCGCGC CCCTCCCCCG ACGGCAAGCG GCTCGCCTGG
CTGTCGCAGG CGCGCGGCGG CTACGAGTCC GACCGCGTCC GCGTGATGGT CGCCGGGATC
GACGGGAAGG ACGCCCGCGA TCTCACCGCG GGCGTGGATC TCTCGGCGAG CGACCTGCAC
TGGGCGCGGA AGGGCGGGGC GCTGCGCTTC GTCGCGCTCA CCTCCGGCTA CCACGAGGTC
TACGAGGTGG ACGTCCGGAG CGGGGAGCTC GTGAAGCTCG CCGGCGCGCC CGCCCTCGCA
GGCAGGCCGC GCGTGAACGT GCAGTCCGTC TCGTACTCGG CCGACGGGGC GCGCGTCGCG
GCGCTCGTGG ACGGGACCAC CGCGCCGCCG GAGATCGCGG TGCTCGAGGG CAAGCCGGGC
AAGGCCCGCT GGGCGGAGCG CACGCGCCTC GCGGCCGACG CCCTCGCGGG GATCGCGCGC
CCCACGCTGC GGCCCCTCGA GGCCACCTCC AAGGACGGCA CGAAGGTCTT CGGCTGGATC
GTCCTCCCGG CCGAGCACCG CGACGGCCAG CGTCACCCGG CGGCGGTGCT CGTCCACGGA
GGGCCGCAGG GCGCCTGGAA CGACGCCTGG ACCTCGCGCT GGAACGCCAT GCTGTACGCG
GCGCGCGGGT ACGCGGTGGT GCTGCCGAAC CCGCGCGGCT CGACCGGCTA CGGGCAGGCC
TACGTCGACG CCGTCTCGCG CGACTGGGGC GGCAAGCCTT ACGAGGACGT GATGGCGCTC
GTCGATGCCG CCATCGCGCT AGGCGCGGTG GACGGTGCGC GGATGTGCGC AGCGGGCGCG
AGCTACGGCG GCTACATGGT GCACTGGCTG AACGGCCAGA CGGAGCGCTT CCGCTGCCTC
GTGTCGCACG CGGGGATCTT CGACCTCGAG GCCTTCTTCT ATCGCACCGA GGAGCTGTGG
TTCCCAGAGT GGGAGTTCGG GGGGACGCCG TTCGACCAGC CGGCCGACTA CCAGAAGTTC
TCGCCGCACC GCTTCGTGCA GCGCTGGCGG ACGCCGACGC TCGTCTCGGT GGGCGAGCTC
GACTACCGCA CCACCGTCGA CCACGGCTAC GCCGCGTTCA CGGCGCTGCA GCGGCGCGGG
ATCCCTTCGA AGCTTCTCGT CTTCCCGGAC GAAGGACACT GGGTCTCGAA GCCGAAGAAT
GCGAGAGTGT TCTACGATGT TGTGCTCGGG TGGCTCGACG AGCACCTCGC CCCCGCGGCG
AACGTGTCCG CGAAGGCGCG CTGA
 
Protein sequence
MLTSALAALA VAAAAAAPRP FTVDDLVALP RVGAPELSPD GTLLAFTVGR LTGAGDRIVS 
TLYVAPAAGG ASRQLTDRDE RISSPRFSPD GKRLAFVSTR SGAPQAHVLD LASGQIRVAS
ALPGGANELS WTPDGAALLV TADVDPRCGA DAGCNAKAEA EAKGKPRVAT RLLFRHWNAW
RERLRTHVLK VPLDGGAPSD LTPGDRDAPP AVRGGPGDLA VSPDGKTLYF TAVDDALEAA
STNADVFAVP LAGGEARRVT SGPGWDASPR PSPDGKRLAW LSQARGGYES DRVRVMVAGI
DGKDARDLTA GVDLSASDLH WARKGGALRF VALTSGYHEV YEVDVRSGEL VKLAGAPALA
GRPRVNVQSV SYSADGARVA ALVDGTTAPP EIAVLEGKPG KARWAERTRL AADALAGIAR
PTLRPLEATS KDGTKVFGWI VLPAEHRDGQ RHPAAVLVHG GPQGAWNDAW TSRWNAMLYA
ARGYAVVLPN PRGSTGYGQA YVDAVSRDWG GKPYEDVMAL VDAAIALGAV DGARMCAAGA
SYGGYMVHWL NGQTERFRCL VSHAGIFDLE AFFYRTEELW FPEWEFGGTP FDQPADYQKF
SPHRFVQRWR TPTLVSVGEL DYRTTVDHGY AAFTALQRRG IPSKLLVFPD EGHWVSKPKN
ARVFYDVVLG WLDEHLAPAA NVSAKAR