Gene RPB_4331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4331 
Symbol 
ID3912144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4915232 
End bp4916179 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content64% 
IMG OID637886235 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_487929 
Protein GI86751433 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0888724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCCA GATTTTTGGG TTTCGCCGCG GCTGCGGCCG TGCTCGTTTC GGCGCCGCAG 
GCTCACGCCC AGCAGTTCGT CAACGTGCTG ACCGGCGGCA CGTCCGGCGT GTATTATCCG
CTCGGCGTCG CGATCGCGAA GATCTACGGC GACAAGATTC CGAACGTGAA GTCGCAGGTG
CAGGCCACCA AGGCGTCGGT CGAGAACCTC AATTTGCTGC AGCAGGGCCG CGGCGAGATC
GCCTTCACGC TCGGCGACTC GCTGAAAGCG GCGTGGGACG GCGATCCTGA GGCCGGCTTC
AAGGCCAAGC TCGACAAGCT GCGCGTGATC GGCGCGATCT ATCCGAACTA CATCCAGATC
GTCGCCACCG CGGAGTCGGG GATCAAGACG CTCGCCGACC TCAAGGGCAA GAGCCTGTCG
GTCGGCGCGC CGAAATCCGG CACCGAGCTG AATTCCCGCG CCATCCTCAA GGCCGCCGGG
ATGGATTACA AGGACATGGG CAAGATCGAA TATCTGCCGT TCGCCGAATC CGTCGACCTG
ATGAAGAACC GCCAGCTCGC CGCCACGCTG CAATCCGCAG GCCTCGGCGT CGCCTCGCTC
AAGGATCTCA GCAACTCCTC CGAGATCAAC GTGGTCTCGG TGCCGAAGGA CGTGGTCGAC
AAGATCGGCC CGCCGTTCGT CGCCGAAACG ATCCCGGCCG GCACCTACAA GGGCCAGGAC
AAGGACGTTC CGACCGCGGC GGTGATCAAC TATCTCGTCA CTTCGACCGC GGTGTCCGAC
GATCTCGCCT ATCAGATGAC CAAGCTGGTG TTCGACTCGC TGCCGGACCT CGCCAGCGCC
CACGCCGCCG GCAAGGGCAT CAAGCTCGAG ACCGCCGCGG CCGGCAGCCC GGTTCCGCTG
CACCCCGGCG CGATCAAGTA CTTCAAGGAA AAGGGCGTGC TGAAGTAA
 
Protein sequence
MKARFLGFAA AAAVLVSAPQ AHAQQFVNVL TGGTSGVYYP LGVAIAKIYG DKIPNVKSQV 
QATKASVENL NLLQQGRGEI AFTLGDSLKA AWDGDPEAGF KAKLDKLRVI GAIYPNYIQI
VATAESGIKT LADLKGKSLS VGAPKSGTEL NSRAILKAAG MDYKDMGKIE YLPFAESVDL
MKNRQLAATL QSAGLGVASL KDLSNSSEIN VVSVPKDVVD KIGPPFVAET IPAGTYKGQD
KDVPTAAVIN YLVTSTAVSD DLAYQMTKLV FDSLPDLASA HAAGKGIKLE TAAAGSPVPL
HPGAIKYFKE KGVLK