Gene AnaeK_3888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_3888 
Symbol 
ID6785355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp4390730 
End bp4392379 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content70% 
IMG OID642765358 
Productpeptidase U34 dipeptidase 
Protein accessionYP_002136226 
Protein GI197124275 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4690] Dipeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.309446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGACC ACCGCAAGAC CGCCCCCGCC GCCGCGCTCG CCGCGGCAGC CCTCGTCCTC 
TCCCTGCCCG GCGCGGCCGA CGCCTGCACC AGCATCCTGG TCTCGAAGGG CGCGAGCGCG
GACGGCTCCA CCTTCATCAC CTACGCGGCC GACTCGCACG ACCTCTACGG CGACCTCCCG
CTCCGCCCGG CGGCGCAGCA CGCGCCCGGC GCACAGCGCG AGATCATCGA GTGGGACACC
GGCAAGTTCC TGGGCCGCAT CCCGCAGCCG GCCGTCACCT ACCACGTGGT CGGCAACATC
AACGAGCACC AGGTCGCCAT CGGCGAGACC ACCTTCACCG GCCGCAAGGA GCTGCAGGAT
CCCGAGGGCC GGGTGGACTA CGGCTCGCTC ATGTACATCG CGCTGGAGCG CGCCCGCACC
GCGCGCGAGG CGATCCAGGT GATGACCGAC CTCGTGGCCG AGTACGGCTA CGCCTCCACC
GGCGAGTCCT TCTCCATCTC GGATCCGAAC GAGGCCTGGA TCCTCGAGAT GATCGGCAAG
GGGCCGAAGC GGAAGGGCGC GGTCTGGGTG GCCCGCCGCA TCCCGGACGG CTACGTGTCG
GCGCACGCGA ACCACGCCCG CATCCGCCAG TTCCCGCTCG ACGAGCCGAA GACCACGCTC
TACGCGAAGG ACGTCATCTC GTTCGCCCGC GAGAAGGGCT GGTTCAAGGG CAAGGACGCC
GAGTTCAGCT TCGCCGACAC CTACGCGCCG CTCGACTTCG GCGCGCTGCG CGCCTGCGAC
GCGCGGGTGT GGAGCGTGTT CCGCCGCGTG GCGCCGGGGC AGTCGCTGCC GTCCTCCATG
GTGAAGGGAC AGGACCCGAA GGCCGAGCGC GTGCCGCTGT GGGTGAAGGC CGAGAAGCCG
CTCGCGGTGC GCGACGTGAT GGCGCTCATG CGCGACCACT TCGAGGGCAC CGAGCTCGAC
CTGTCGAAGG GCGTGGGCGC GGGCCCGTTC TCGGTGCCGT ACCGCTGGCG GCCCATGACG
TTCAAGGTGG ACGACCAGGA GTACCTGAAC GAGCGGGCCA TCTCGACGCA GCAGACCGGC
TTCTCGTTCG TGGCGCAGTC GCGCGCCGCG CTGCCGGCGG CGGTGGGCGG GGTGCTCTGG
TTCGGCGTGG ACGACACGTA CAGCACCGTC TACGTGCCGA TGTACTGCTC GATCCACGAG
GTGCCGCGCA GCTTCGCGGT GGGCACCGCC GACTTCAAGA CGTTCAGCTG GGACTCGGCG
TTCTGGGTGT TCAACTTCGT GTCGAACTGG GCCTACTCGC GCTACTCGGA CATGATCCAG
GACGTGCAGC AGGTGCAGGG CGAGCTGGAG GGCGGGTTCC TCTCGCGGCA GGCGGAGCTG
GAGAAGGCCG CCCTGACGCT CTACAAGGAC TCGCCCGGCC TGGCCCGCGA CTACCTCACC
CGCTACTCGG TCAGCCAGGG CGACATGGTC ACGGCGCGCT GGCGCAAGCT GGGCGAGTCG
CTGATGGTGA AGTACCTCGA CGGCAACGTG CGCGACGCGC AGGGCAACGT CACGCACCCG
GACTACCCGG AGGCCTGGCG CCGGCGCATC GCGGCCGAGG ACGACGGCAT CCTGCGGGTG
CCGAAGGAGC CGCAGAAGGT GGCGCAGTAG
 
Protein sequence
MLDHRKTAPA AALAAAALVL SLPGAADACT SILVSKGASA DGSTFITYAA DSHDLYGDLP 
LRPAAQHAPG AQREIIEWDT GKFLGRIPQP AVTYHVVGNI NEHQVAIGET TFTGRKELQD
PEGRVDYGSL MYIALERART AREAIQVMTD LVAEYGYAST GESFSISDPN EAWILEMIGK
GPKRKGAVWV ARRIPDGYVS AHANHARIRQ FPLDEPKTTL YAKDVISFAR EKGWFKGKDA
EFSFADTYAP LDFGALRACD ARVWSVFRRV APGQSLPSSM VKGQDPKAER VPLWVKAEKP
LAVRDVMALM RDHFEGTELD LSKGVGAGPF SVPYRWRPMT FKVDDQEYLN ERAISTQQTG
FSFVAQSRAA LPAAVGGVLW FGVDDTYSTV YVPMYCSIHE VPRSFAVGTA DFKTFSWDSA
FWVFNFVSNW AYSRYSDMIQ DVQQVQGELE GGFLSRQAEL EKAALTLYKD SPGLARDYLT
RYSVSQGDMV TARWRKLGES LMVKYLDGNV RDAQGNVTHP DYPEAWRRRI AAEDDGILRV
PKEPQKVAQ