Gene RPB_3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3620 
Symbol 
ID3911422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4155106 
End bp4156356 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content64% 
IMG OID637885522 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_487226 
Protein GI86750730 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.963637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.598514 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA ATCGCCGTCA TCTCGTGATC GGAGGCGCCG CCAGCGCCGC CGTCCTGCCG 
TTCGGAACTG CCGCGCGCGC GCAGGCCAAG GAAGTCGTGA TCGGCGTGAT CTATCCGCTG
TCCGGCGCCA GCGCCCAGAT CGGCGTCGAC GCCCAGAAGG CGTTTCAGAC CGCGGCCGAA
CTGATCAACA ACAAATACGA TTTCGACCTG CCGCTGGCCC GTGACGAGGG CCTTCCCGGT
CTCGGCGGCG CCAAGGTGCG GCTGGTGTTC GCCGATCACC AAGCCGACCC GCAGAAGGGC
CGCGCCGAAG CCGAGCGCCT GATCACGCAG GAAAAGGTCA GCGCCATCGT CGGCACCTAT
CAGAGCGCGG TCGCCGTCAC CGTCAGCCAG ATCTGCGAGC GCTACCAGGT TCCCTTCCTG
TCGGCCGACA ATTCGTCGCC GAGCCTGCAT CGTCGCGGCC TCAAATACTA TTTCCGCGCC
GCGCCGCACG ACGAGATGTT CTCGCAGGCG ATGTTCGACT TCTTCGATGC GCTGAAGAAG
AAGGGCAAGA AGATCGAGAC GCTGGCGCTG TTCCACGAGG ACACCATTTT CGGCACCGAC
TCGTCGAACG CCCAGCTCAA GCTCGCCAAG GACCGTGGCT ACAAGATCGT CGCCGATATC
AAGTATCGCG CCAACTCGCC GTCGCTGACC GCCGAGGTGC AGCAACTCAA GGCGGCCGAC
GCCGACGTGC TGATGCCGTC GAGCTACACC ACCGACGGCA TTCTGCTGAT CCGCACCATG
GGCGAACTCG GCTACAAGGC CAAGAACATC GTCGCGCAGG ACGCGGGCTT CTCGGAGAAG
GCGCTGTACG ACGCGGTCGG CGACAAGATC CCGGGCGTGA TCTCGCGCGG CTCGTTCTCG
CTCGACCTCG CCGCCAAGCG GCCGATGGTC GGCAAGATCA ACGACATGTA CAAGGAGCGC
TCCGGCAAGG ACTTCAACGA CTACTCGTCG CGGCAGTTCA TGGGCCTGAT CGTGATGGCC
GACGCCATCA ACCGCGCCAA ATCGACCGAC GGCGAGAAGA TCCGCGAGGC GCTGGTCGCC
ACCGACATGC CGGGCGAAAA GACCATCATG CCGTGGAAGC AGGTGAAGTT CGACGCCGAA
GGCCAGAACA CCTTCGCCGA CCCGGTGTTG CTGCAATATG TCGGCGGCAA GTTCGTGACG
ATCTTCCCGG AACAGGCCGC AGTCGCCGAG GCGATCTGGC CGATGCCGTA A
 
Protein sequence
MKINRRHLVI GGAASAAVLP FGTAARAQAK EVVIGVIYPL SGASAQIGVD AQKAFQTAAE 
LINNKYDFDL PLARDEGLPG LGGAKVRLVF ADHQADPQKG RAEAERLITQ EKVSAIVGTY
QSAVAVTVSQ ICERYQVPFL SADNSSPSLH RRGLKYYFRA APHDEMFSQA MFDFFDALKK
KGKKIETLAL FHEDTIFGTD SSNAQLKLAK DRGYKIVADI KYRANSPSLT AEVQQLKAAD
ADVLMPSSYT TDGILLIRTM GELGYKAKNI VAQDAGFSEK ALYDAVGDKI PGVISRGSFS
LDLAAKRPMV GKINDMYKER SGKDFNDYSS RQFMGLIVMA DAINRAKSTD GEKIREALVA
TDMPGEKTIM PWKQVKFDAE GQNTFADPVL LQYVGGKFVT IFPEQAAVAE AIWPMP