Gene Adeh_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAdeh_4020 
Symbol 
ID3888944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-C 
KingdomBacteria 
Replicon accessionNC_007760 
Strand
Start bp4630073 
End bp4631443 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content70% 
IMG OID637865578 
Productamino acid/peptide transporter 
Protein accessionYP_467221 
Protein GI86160436 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.319297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCGT CCGCGCCCGA GCGCTACCCG CCCCAGGTCA AGTACATCGT CGGCAACGAG 
GCCTGCGAGC GCTTCTCGTT CTACGGGACG AGCTCGATCC TCACCGTCTA CATGCTGCAG
CACCTGCTGT ACGAGGCGCA GGACGCGAAG GCCTACTACC ACTACTTCGT GATGGCGACC
TACCTCACGC CGCTGGTGGG CGGGTGGATC GCCGACCGGT ACCTGGGGCG GTACCGGACC
ATCCTCTGGA TCTCGCTCTT CTACGTGCTC GGCCACGGCG TGCTCGCCGC CTGGGAGACG
CGCACCGGCT TCTTCGTGGG GCTCTGCCTC ATCGCCGCCG GCGCCGGCGG CATCAAGCCG
TGCGTGTCGG CGTTCGTGGG CGACCAGTTC CGGCCCGAGC AGCACGGCCT GCTCCAGCGC
GTGTACGGCT GGTTCTACTG GTCCATCAAC CTCGGCTCGG CGAGCGCGAA GCTCCTCATC
CCGCTCCTGC TCCTCACCGT CGGCCCGTCG GTGGCGTTCG CGCTGCCAGG CGTCCTCATG
GCCGTGGCGC TGGTGGTGTT CTGGCTCGGC CGCAAGCACT ACGCGTTCGC GCCGCCCTCC
GGGCCGAACC CGCACGGCCT GTTCCGCGTG GTCGGCTTCG CGCTCTCGCG CCTCGGCACC
GGCAAGCCCG GCCAGCACTG GCTCGACGCC GCGCGCGAGC GCCATCCGCA GGAGGCGGTG
GACGGGGCGA AGGCGGTGTT CCGCATCATG GGCGTGTACG CGGCGGTGAC GCTGTTCTGG
GCGCTCTACG ACCAGAAGGG CTCGAGCTGG GTGCTCCAGG CGAAGGACAT GGACCTGCAC
CTCGGCACGC TCACGCTCTC GCCGGCGCAG CTGCAGGCGC TGAACCCGTT CATGGTGATG
GCGCTCATCC CGCTGTTCAA CTGGGTGATC TTCCCGGCGC TGGAGCGGCG CGGGATCGCG
CGCACGCCGC TCTCGCGCAT GACCGGCGGC ATGTTCCTCA CCGTGCTGTC GTTCGCGGCC
GCGGCGGTGG TGCAGACGCT CATCGACGCC GGCCACGCGC CGCACGCGCT CTGGCAGCTC
CCGCAGTACC TGCTGCTCAC CACCGGCGAG GTGCTGGTCT CGGTGACCGG GCTCGAGTTC
AGCTACACGC AGGCGCCGCG CTCCATGCGC AGCACCATCA TGTCGCTCTG GTTCCTCACC
ATCGCGCTCG GGAACCTGCT CACCGCGCTC GTCACCGAGC TCGTGCCGCT CTCCGGCGCC
GCGTACTTCT GGGCGTTCGC CGCGCTCATG CTGGCGGCCG CGTTCGCGTT CCAGGCCATC
GCCCGGCGCT ACCGCCCGGT GGCGCTGCCG TCCGCGGCGG TGGCCGAGTA G
 
Protein sequence
MSASAPERYP PQVKYIVGNE ACERFSFYGT SSILTVYMLQ HLLYEAQDAK AYYHYFVMAT 
YLTPLVGGWI ADRYLGRYRT ILWISLFYVL GHGVLAAWET RTGFFVGLCL IAAGAGGIKP
CVSAFVGDQF RPEQHGLLQR VYGWFYWSIN LGSASAKLLI PLLLLTVGPS VAFALPGVLM
AVALVVFWLG RKHYAFAPPS GPNPHGLFRV VGFALSRLGT GKPGQHWLDA ARERHPQEAV
DGAKAVFRIM GVYAAVTLFW ALYDQKGSSW VLQAKDMDLH LGTLTLSPAQ LQALNPFMVM
ALIPLFNWVI FPALERRGIA RTPLSRMTGG MFLTVLSFAA AAVVQTLIDA GHAPHALWQL
PQYLLLTTGE VLVSVTGLEF SYTQAPRSMR STIMSLWFLT IALGNLLTAL VTELVPLSGA
AYFWAFAALM LAAAFAFQAI ARRYRPVALP SAAVAE