Gene RPB_2967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2967 
Symbol 
ID3910766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3378739 
End bp3379959 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID637884873 
Productaminotransferase 
Protein accessionYP_486580 
Protein GI86750084 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAT TTTACCGCAT CCGCCGCCTG CCGCCTTACG TGTTCGAACA GGTCAACCGG 
GCCAAGGCGG CCGCGCGGAA TGCCGGGGCC GACATCATCG ACATGGGGAT GGGCAACCCG
GACCTGCCGG CGCCGCCGCA CGTGCTGGAG AAGCTCAAGG AGACGCTCGG CAAGCCGCGC
ACCGACCGCT ATTCCGCCTC GCGCGGCATC ACCGGGCTGC GCAAGGCCCA GGCGGCCTAT
TACGACCGCC GGTTCGGGGT GAAGCTGAAT CCCGACACCC AGGTGGTCGC CACGCTCGGC
TCCAAGGAAG GCTTCGCCAA CGTCGCCCAG GCGATCACCG CGCCAGGCGA CGTCGTGCTG
TGCCCGAATC CGAGCTACCC GATCCACGCC TTCGGCTTCC TGATGGCCGG CGGCGTGATC
CGCTCGGTGC CGTCCGAGCC GACCCCGGAA TTCTTCGCCG CGGCCGAGCG CGCCATCATC
CATTCGATCC CGAAGCCGAT CGCGCTGATC GCCTGCTATC CGTCGAATCC GACCGCCTAT
GTGGCCAGCC TCGATTTCTA CAAGGATCTG GTGGCGTTCG CGAAGAAGCA CGAGATCATG
ATCCTGTCGG ATCTGGCCTA TGCCGAAGTC TATTTCGACG ATGCCAACCC GCCGCCGTCG
GTGCTGCAGG TGCCGGGCGC GATGGACGTC ACCGTCGAGT TCACCTCGAT GTCGAAGACG
TTCTCGATGG CCGGCTGGCG GATGGGCTTT GCGGTCGGCA ACGAGCGCAT CATCGCCGCC
CTGGCGCGGG TGAAGTCGTA TCTCGATTAC GGTGCGTTCA CGCCGGTGCA GGTCGCCGCC
ACCGCGGCGC TGAACGGCCC GGACGATTGT ATCCGCGAGA TGCGCGAGAC CTACAAGAAG
CGCCGCGACG CGCTGGTCGA GAGCTTCGGC CGGGCCGGCT GGGAGATCCC GCCGCCGTCG
GCCTCGATGT TCGCCTGGGC GCCGCTGCCG CCGGCATTTC GCGAGATCGG CAGCATGCAG
TTCGCCACCC TGATGGTGGA GAAATCCGGC GTCGTGGTGT CGCCCGGCGT CGCCTTCGGC
GAGCACGGCG AAGGCTATGT TCGCATCGCC ATGGTGGAAA ACGAGCAGCG CATCCGCCAG
GCGGCGCGCG GCGTCCGGCG CTTCCTTGAA AGCGGCGTCG AAACGTTGCA CAACGTGGTG
CCTCTCGCCA CCCGGCGATA G
 
Protein sequence
MEEFYRIRRL PPYVFEQVNR AKAAARNAGA DIIDMGMGNP DLPAPPHVLE KLKETLGKPR 
TDRYSASRGI TGLRKAQAAY YDRRFGVKLN PDTQVVATLG SKEGFANVAQ AITAPGDVVL
CPNPSYPIHA FGFLMAGGVI RSVPSEPTPE FFAAAERAII HSIPKPIALI ACYPSNPTAY
VASLDFYKDL VAFAKKHEIM ILSDLAYAEV YFDDANPPPS VLQVPGAMDV TVEFTSMSKT
FSMAGWRMGF AVGNERIIAA LARVKSYLDY GAFTPVQVAA TAALNGPDDC IREMRETYKK
RRDALVESFG RAGWEIPPPS ASMFAWAPLP PAFREIGSMQ FATLMVEKSG VVVSPGVAFG
EHGEGYVRIA MVENEQRIRQ AARGVRRFLE SGVETLHNVV PLATRR