Gene RPB_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1978 
Symbol 
ID3909483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2247164 
End bp2248372 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID637883872 
Productextensin-like protein 
Protein accessionYP_485597 
Protein GI86749101 
COG category[S] Function unknown 
COG ID[COG3921] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.259223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCG GAGTTCGTTT GTATCTCGTC GGCTCCTTCG TCCTCGTGTC GCTTGCCGGT 
TGCGGACGCG GTTTGTTTCA AACCGCCGAG CGTGAACCGT GGCGGACCGA GGCCGAGGTC
GCTTGTCTGA AATCGGGCGC CGTCAAGGAA GGCCCCGAAC TGGTCCGGGT CGATCCGATC
TCGGGCCCCG GCGTCTGCGG CGCCGAGTTT CCCCTCAAAG TGGCTGCGCT CGGCGAGAGT
GGCGCGATCG GCTTTGCCGA CGATTTGCGT CCGCCAGCGG CGATCGGAGG TCGCGCCAGC
CAGCCGCGCT GGCCGGGGGC GCAGCCGTCT TATGCGGCGC CGGCGCGCGG CTATCCGCAA
CAGCAAACCG GCTACGGCGC TTCGAATCCT CCCTACGGCA GCAACAATGC TCCGGTGTCG
CTGACCGCGC CCGGCGTCGG CCCCGCGGGC CGGGACATCG ATCTGCCGGA CGAGGGCGCG
CTGCCGCCTG CGGATCGTCC GCCGGCCGAG CACGTCACCG GCTATTCGCG CGATCCGAGC
TACGCACCGG CGCCCGCCGG TCGTGCGCCG GACGACGCGC GGCGCCCATT GCCGCGGCTC
GGTCCGGCGC AGCAGGGCAA CATCACCGGC TCGGTCGGGC CGGTCGCGAT CAAGCCGGTG
GCGACGCTGG CGTGTCCGAT CGTCTCCGCG CTGGACCGCT GGCTGGTGGA ATCGGTGCAG
CCGTCGGCGA TGCGCTGGTT CGGTGTCCCC GTCGTCGAAA TCAAGCAGAT CTCGGCCTAT
TCGTGCCGCG GCATGAACGG CAATCCGAAC GCGCACATCT CCGAACACGC CTTCGGCAAC
GCGCTCGACA TCTCCGCCTT CGTGCTGGCC GACGGCCGGC GCGTGACGGT GAAGGGCGGC
TGGAAGGGAT TGCCGGAAGA GCAGGCGTTC CTGCACGACG TGCAGAACTC GGCGTGCCAA
ATGTTCAACA CCGTGCTGGC GCCGGGCTCG AACATCTATC ACTACGATCA CATCCACGTC
GACCTGATGC GCCGCAAGAG CCAGCGCAGC ATCTGCAAGC CCGCCGCGGT GCCGGGCGAA
GTGATCGCGC AGCGGCTGCA GGGGCGCAAT CCCTATGCGT CGGGCAATTG GAACGGCGTC
ACCGGCTCGA TCGGCAAAGT CCCGGCGCGT GCGAAGGCGG TGGATCGCGA CGAAGCCGAA
GACGATTAG
 
Protein sequence
MTRGVRLYLV GSFVLVSLAG CGRGLFQTAE REPWRTEAEV ACLKSGAVKE GPELVRVDPI 
SGPGVCGAEF PLKVAALGES GAIGFADDLR PPAAIGGRAS QPRWPGAQPS YAAPARGYPQ
QQTGYGASNP PYGSNNAPVS LTAPGVGPAG RDIDLPDEGA LPPADRPPAE HVTGYSRDPS
YAPAPAGRAP DDARRPLPRL GPAQQGNITG SVGPVAIKPV ATLACPIVSA LDRWLVESVQ
PSAMRWFGVP VVEIKQISAY SCRGMNGNPN AHISEHAFGN ALDISAFVLA DGRRVTVKGG
WKGLPEEQAF LHDVQNSACQ MFNTVLAPGS NIYHYDHIHV DLMRRKSQRS ICKPAAVPGE
VIAQRLQGRN PYASGNWNGV TGSIGKVPAR AKAVDRDEAE DD