Gene RPB_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4054 
Symbol 
ID3911861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4624766 
End bp4626169 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content70% 
IMG OID637885958 
Productamidase 
Protein accessionYP_487658 
Protein GI86751162 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases 
TIGRFAM ID[TIGR02715] amidohydrolase, AtzE family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.200253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCCG AAACATCCTC CGGCTGGATG ACCGCCGCCG AGATCGCGGC GGCCGTCGCC 
AACCGGACGA TGACCGCCCT CGACGCCACC GAAGCGGCGC TGGCGCGGAT CGCGCAGCGC
GACGCGACCC TGAATGCCTT CACCGACATC GTCGCCGAGC GCGCCCGCAA TCGCGCCCGC
GCGATCGATG CAGCGATCGC GCGCGGCGAA CAGGTCGGCC CGCTCGCCGG CGTGCCGTTC
GCGGTGAAGA ACCTGTTCAA TGTCGCGGGC CTGACGACCC GCGCCGGCTC CAAGATCAAT
CGCGACCTTG CCCCCGCGAA ACGCGATGCC ACGCTGATCG AGCGGCTGGA AGCCGCCGGC
GCGGTGCTGA TCGGCGCGCT CAACATGGGC GAATACGCCT ACGACTTCAC CGGCGAGAAT
TTTCACGACG GCCCGTCGCG CAATCCGCAC GACCCGACGC GGATGACCGG CGGCTCGTCC
GGCGGTTCGG GTGCCGCGGT CGGCGGCGGC GAGGTGCCGC TGGCGCTCGG CTCGGATACC
AATGGCTCGA TCCGGGTGCC GTCGTCGTTT TGCGGCATCT TCGGGCTGAA GCCGACCTAT
GGCCGGCTGC CGCGCTCGCG CTCGTTTCCG TTCGTCGCGA GCTTCGATCA CCTCGGCCCG
TTCGCGCGCA ACGTCGCAGA TCTCGCGCTC GCCTATGACG TGATGCAGGG GCCGGATTCC
GACGACGCCG CCTGCTCGAC GCGTTCGATC GAGCCGGTTC ACGCCGCGCT GGCGCAAGGC
CTCGACGGCC TGCGCATCGC GCGGGCCGGT GGGTACTTCG CGGCCAATCT GTTTCCCGAA
GCCCGTGAAG CCGTCGATCG CGTGGCGAAG GCGCTCAGTA TCACCATCAC CGTGGAACTG
CCCGAAGCCG CGCGCGCCCG CGCCGCGGCC TTCGTCATCA CCACTGTCGA GGGCGCGTCG
CTGCATCTCG ATCGTCTGCG CCAGCGGCCG AATGATTTCG ACCCCGCGGT GCGCGACCGG
CTGATTGCAG GGGCGATGAT CCCGGCGCCG CTGGTCGACC GCGCGCAGAA ATTTCGCCGC
TGGTATCGCG CGCGCGTGCT CGAGTTGTTC AGGGACGTGG ATGTCATCAT CGCGCCGGCG
ACCCCCTGCG TCGCGCCGAA GCTCGGGCAG CAGAGCTTCA TGCTCGACGG CGTGGAGCTG
CCGGTGCGCG CCAATATCGG CATCCACACC CAGCCGATCT CGTTCATCGG CCTCCCGGTC
GTGGCCGTGC CGATCCCGCT CGAGCCGATG CCGATCGGCA TCCAGATCAT CGCCGCGCCG
TGGCGCGAGG ACCTCGCGCT GCGCGTCGCG CATGCGCTCG AAACAGCCGG CGTCGCCTGC
GCGCCGCGCC CGCAAACCTT TTGA
 
Protein sequence
MKSETSSGWM TAAEIAAAVA NRTMTALDAT EAALARIAQR DATLNAFTDI VAERARNRAR 
AIDAAIARGE QVGPLAGVPF AVKNLFNVAG LTTRAGSKIN RDLAPAKRDA TLIERLEAAG
AVLIGALNMG EYAYDFTGEN FHDGPSRNPH DPTRMTGGSS GGSGAAVGGG EVPLALGSDT
NGSIRVPSSF CGIFGLKPTY GRLPRSRSFP FVASFDHLGP FARNVADLAL AYDVMQGPDS
DDAACSTRSI EPVHAALAQG LDGLRIARAG GYFAANLFPE AREAVDRVAK ALSITITVEL
PEAARARAAA FVITTVEGAS LHLDRLRQRP NDFDPAVRDR LIAGAMIPAP LVDRAQKFRR
WYRARVLELF RDVDVIIAPA TPCVAPKLGQ QSFMLDGVEL PVRANIGIHT QPISFIGLPV
VAVPIPLEPM PIGIQIIAAP WREDLALRVA HALETAGVAC APRPQTF