Gene RPB_4215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4215 
Symbol 
ID3912023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4787696 
End bp4788799 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content66% 
IMG OID637886118 
ProductABC transporter related 
Protein accessionYP_487817 
Protein GI86751321 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.611478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCA TTGACCTCGT GGACCTGGCT CACTCCTACC TCGTCGGCGA CGATCTGCCG 
CCGGCTGCCT ATGCGCTGAA GCCGGTATCG ATGACCTGGC GGCAGGGCGG CGCCTATGCA
CTCCTCGGCC CCTCCGGCTG CGGCAAGACC ACGCTGCTCA ATCTGATTTC CGGCATCGTG
ACGCCGTCGC GCGGCAAGAT CCTGTTCGAC GGCACCGACG TCACGCGGCT GTCGACCCGC
GAGCGCAACA TCGCGCAGGT GTTCCAGTTT CCGGTGATCT ACGACACCAT GACGGTGCGG
GAGAATTTGG CGTTTCCGCT GAAGAATCGC GGCGTGCCGA AGCCTGAGAT CGACAGGCGC
GTCGCCGAGA TCGCCGATCT GCTCGACCTC ACGCCGAATC TGGGGCGCAA GGCGACGCGG
CTGACCGCCG ACGCCAAGCA GAAGATCTCA CTCGGCCGCG GCCTGGTCCG CTCCGACGTC
GCCGCGATCC TGTTCGACGA ACCGCTCACG GTGATCGATC CGCATCTGAA GTGGGAGTTG
CGCTCCAAGC TGAAGGCGCT GCATCGCGCG CTGGATCTCA CGATGATCTA CGTCACCCAC
GACCAGACCG AAGCGCTGAC CTTCGCCGAC ACCGTCGTCG TCATGCATGA CGGCCGTGTG
GTGCAAAGCG GCACGCCGGA GGAACTGTTC GAGAAGCCGG CGCACACCTT CGTCGGTTAC
TTCATCGGCT CGCCCGGCAT GAACATCGTG CCGGCGCAGA TCCGCGGCCG CGAGGCGCTG
ATCGACGGCC ATGCGATCAC ACTCGCCCGC GGCTACGACA ATCTGCCATC CGGGGCCAAG
ATCGAGATCG GGGTGCGGCC GGAATTCGTG CACCTCACCG CGAAGGCGCC GGGGTTTTTG
TCCGGCCGCA TCGAGCGGAT CGACGACCTC GGCCGCATCC GTTTCGCCTG GGTGCGGGTC
GGCGGCGTCC GCTTCGCCGC GCGGGTCCCG GACGGATTCT CCGCCGACGG CGACGAGGTC
GGTCTGATGA TCGAACCGTC GCGCGTCCAC GTCTATGCCG ACAGCGAGAT CGTCGAAGGA
AGCGCGCTGG AGCAGGTCGC CTGA
 
Protein sequence
MARIDLVDLA HSYLVGDDLP PAAYALKPVS MTWRQGGAYA LLGPSGCGKT TLLNLISGIV 
TPSRGKILFD GTDVTRLSTR ERNIAQVFQF PVIYDTMTVR ENLAFPLKNR GVPKPEIDRR
VAEIADLLDL TPNLGRKATR LTADAKQKIS LGRGLVRSDV AAILFDEPLT VIDPHLKWEL
RSKLKALHRA LDLTMIYVTH DQTEALTFAD TVVVMHDGRV VQSGTPEELF EKPAHTFVGY
FIGSPGMNIV PAQIRGREAL IDGHAITLAR GYDNLPSGAK IEIGVRPEFV HLTAKAPGFL
SGRIERIDDL GRIRFAWVRV GGVRFAARVP DGFSADGDEV GLMIEPSRVH VYADSEIVEG
SALEQVA