Gene Rru_A2871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2871 
Symbol 
ID3836311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3307070 
End bp3308245 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content68% 
IMG OID637826982 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_427955 
Protein GI83594203 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCCG CCATCCCGCC CGCCATCGCG GCGCTGACCG GTGATATGAA AGCGTGGCGT 
CATCATCTGC ACGCCCATCC CGAAACCGCC TTCGAAGAGC ACGCCACCGC CGATTTCATC
GCCGGGCTGC TCGACGACTT CGGGGTCGAG GTCCATCGCG GGCTGGCCGG AACCGGGGTG
GTCGGGGTGA TCGCCGGCAA ACGGACGGGA AACCGCGCGA TCGGGTTGCG CGCCGATATC
GACGCCCTGC ACGTCACCGA GGCCACCGGC CTGCCCCACG CCTCGGTCCA TGCCGGGCGC
ATGCACGCCT GCGGCCATGA CGGCCACACG GCGATGCTGC TGGGAGCGGC CAAGCATCTG
GCCGCGACCC GCGATTTCGC CGGCAGGCTG ATCCTCATCT TCCAGCCCGC CGAGGAAAAC
GAGGGCGGCG GCAAGGTGAT GGTCGAAGAG GGCTTGTTCG ACCGGTTCCC CGTGGATGCG
GTCTATGGCA TGCACAACTG GCCGGGGCTG GAGGAAGGCC ACTTCGCCCT GCGCACCGGT
CCGATCATGG CCGGCTATGA CGTGTTCGAG ATCACGCTTA CCGGCAAGGG GGGCCATGCC
GCCATGCCCC ATCTCGGCAC CGATCAGTTG GTGGCGGCCG GGCATCTGAT GACCGCCTTG
CAGTCGATCG TCGCCCGCTC GGTCAATCCG ACCGAGGCGG CGGTGGTGTC GGTCACCCAG
ATGCACGGCG GCGACACCTG GAACGTCCTG CCCGCCAGCG TCGTGCTGCG TGGCACCGTG
CGCACCTTCA CCAAAGCCGT GCAGGATCTG ATCGAGACGC GGATCACCGA GCTGTCGCGA
TCGATCGCCC AGGGCTTTGG CGCCGAGGCG GCGATCCATT ACGAGCGGCG CTATCCCGCC
ACCGTCAACA GCCCCGAGGA AGCCGCCGTC GCCGCCCGCG TGGCCAGCGC CGTGGTCGGC
GCCGACAAGG TGGACACCAA TTGCCCGCAG ACCATGGGGG CGGAGGATTT CGCCTTCATG
CTGGGGGTCA AGCCGGGCGC CTATGTGCAG CTTGGCGCCG GCCCGGGGCG GGGCGGTTGC
ATGCTCCACA ACCCCGGTTA CGACTTCAAC GACGCCCTTC TGGGCGTAGG GGCGAGCTAT
TGGGTGGGGC TGGTCCACGA CCAACTGGCC GGCTAG
 
Protein sequence
MTPAIPPAIA ALTGDMKAWR HHLHAHPETA FEEHATADFI AGLLDDFGVE VHRGLAGTGV 
VGVIAGKRTG NRAIGLRADI DALHVTEATG LPHASVHAGR MHACGHDGHT AMLLGAAKHL
AATRDFAGRL ILIFQPAEEN EGGGKVMVEE GLFDRFPVDA VYGMHNWPGL EEGHFALRTG
PIMAGYDVFE ITLTGKGGHA AMPHLGTDQL VAAGHLMTAL QSIVARSVNP TEAAVVSVTQ
MHGGDTWNVL PASVVLRGTV RTFTKAVQDL IETRITELSR SIAQGFGAEA AIHYERRYPA
TVNSPEEAAV AARVASAVVG ADKVDTNCPQ TMGAEDFAFM LGVKPGAYVQ LGAGPGRGGC
MLHNPGYDFN DALLGVGASY WVGLVHDQLA G