Gene RPD_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1954 
Symbol 
ID4022436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2193043 
End bp2195064 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content64% 
IMG OID637962147 
ProductATPase 
Protein accessionYP_569090 
Protein GI91976431 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.342143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.310254 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATC TCGGCATCGC CATCTACGAC CCGCGTCGGC TCGATCCCGA AACCTTCCTC 
AAAGGTTTCG TGGCGCGGGG CGATTTCGTC GACTTCCTGC TCGACAAGCT TCGCCAGATG
CCGGAGATCG GCGAGCATTT TCTGATCGTC GGCCCGCGCG GCATTGGCAA GACCAGCCTG
CTGCGGCGTC TCGCGATCGG GATCTCGGAG GAGCCCGCCC TGCGCGCGCG CTTCATTCCG
CTGAGTTTTC GCGAGGAGCA GTACAATGTC CGCGCGCTCG ATGCGTTCTG GAGAAACTGC
GCCGAGTCGC TCGCCGAATG GTGCGAGGAT CAAGGCCGGC AGGAGATCGC CGACGATATC
GACCGCAGCC TGCCGAGCGC GGAATGGCAT GAGTCGAACT CCGCGGTTCA GGCTTTCCTC
GCTCTCTGCA AGCGGCTCGG CGGCCGGCCG GTGCTGTTCG TCGACAATCT CGATCTGATC
CTCGACGCGC TGTCGCCGCA GCAGAACTGG GAGCTGCGGC GCACGCTGCA GGCACCCGGC
GGACCGATCG TATTCGGCGC CGCCACGCAG ATGCTGCGGC AGAGCGCCGA CCGCGACGCG
GCGTTCTACG AGTTCTTTCA CCCGCACATG TTGCATCCGC TCTCCGAAAG CGAGTTGCGG
CACTGCATGT CCCGGCTGGC GCAGGCGCGC GGCGATTTGG GCAAGCCGGT CCTCGATGTG
CTCGACCGCG AGCCGGAACG CATCCGCACG CTGCACAATC TGACCGGCGG CAATCCGCGG
GTGCTGACGC TGGTCTATCA ATTGCTCGAG CGCACCGAGA GCGACAGCGT GTTCCGCGAT
CTCGAAGTGC TGCTCGATCA GTTGACGCCC TATTACAAGG CGCGGGTCGA GGATTACCAG
ACCGATCTGC AGCGCGCGGT GATCGACGCC ATCGCACTGC ACTGGCATCC GATCACGTCG
AGCCGCCTGA GCGACATCAC CGCGGTCGAG GTCACCACGA TCTCGTCGCA GCTCAATCGG
CTGAAGAATG ACGGGCTGAT CGAGGAGGTC GAGACTTCCG GCGCGCGCGC CGGCTATCAG
CTCGTCGAGC GGTTCTTCAA CATCTGGTAT TTGATGCGAC ACGGCACCCG CCGTACCCGG
CAGAAGATCG CCTGGCTGAC GGAGTTCTTG AGCAGTTTCT ATGCACCGGC CGAGCTGATG
AAGATGAAGG CTGAGCTGAT CGCCGGCGGG AGCGCGTCGC TGCATCCGCT CTATCGCGAG
GCGCTGGAGG CGGCCGGTGA GGAGAGTGGG AGGTTGGCGC GGGCTGCAGT GCCAGAGCCC
TCTGTCGAGA CATCGCAATT GGGGAGGTCG AGCGATCTGT TTAGAGAGGC GGAGCACATT
GTCCGCCATT GGGTCGAGCG CGATCAGACG AATTATGACG GTTGGTCCTT GCTCGGTAGT
ATCCTTGCAG ACCATTCTGG ACATCCCGCC GAAGCTGAGG CTGCTTATCG GAAGGCAATG
ACGATCTCCG GCGATCGAGT GATCGCAGAG GCCAATCTGG CGTGGCTACT TTTCGCCTCG
GGTCGGTTGT CGGAAGCAGC CTCGCTCGAA TCCGCGCTGA CCAAGCTCGA TCCGGTCGGC
CGCGCGCTGC TCGATGCGGC GCGCGCGCTC GTGCAAGACA ATTTCGGCGA CACGACGGGG
CATTTGCAGC AGGCGTTGAA CAGCGATCTG GTGCAACTGA ACGCGACCTT CTCCGACGAT
CTTCTCCGGC TGCTCCGGAT CGCGGCGCAG CGCGGCTATG GCGAGAAGCT GATCGAATGG
TTCAACCAGT CCGGGCAGGC GGATCGGCGG GCGCCGGTCT ATGCGGCCCT CGTCGCTTTC
GTGCGCGGCG AACGGTTCCT GCTGGACTTC AGCCCGGAGA TCCGTAAACC GGCCGAGTCG
ATTTTCCGCT GGCTGAACTC GCGTTCGGAC AGATCCCCAT CAACTCCCGA CAAGCCCGCG
CGGAAACGTG GCAGGCCGCC GCGCAAACGC CAGACCGCAT GA
 
Protein sequence
MSNLGIAIYD PRRLDPETFL KGFVARGDFV DFLLDKLRQM PEIGEHFLIV GPRGIGKTSL 
LRRLAIGISE EPALRARFIP LSFREEQYNV RALDAFWRNC AESLAEWCED QGRQEIADDI
DRSLPSAEWH ESNSAVQAFL ALCKRLGGRP VLFVDNLDLI LDALSPQQNW ELRRTLQAPG
GPIVFGAATQ MLRQSADRDA AFYEFFHPHM LHPLSESELR HCMSRLAQAR GDLGKPVLDV
LDREPERIRT LHNLTGGNPR VLTLVYQLLE RTESDSVFRD LEVLLDQLTP YYKARVEDYQ
TDLQRAVIDA IALHWHPITS SRLSDITAVE VTTISSQLNR LKNDGLIEEV ETSGARAGYQ
LVERFFNIWY LMRHGTRRTR QKIAWLTEFL SSFYAPAELM KMKAELIAGG SASLHPLYRE
ALEAAGEESG RLARAAVPEP SVETSQLGRS SDLFREAEHI VRHWVERDQT NYDGWSLLGS
ILADHSGHPA EAEAAYRKAM TISGDRVIAE ANLAWLLFAS GRLSEAASLE SALTKLDPVG
RALLDAARAL VQDNFGDTTG HLQQALNSDL VQLNATFSDD LLRLLRIAAQ RGYGEKLIEW
FNQSGQADRR APVYAALVAF VRGERFLLDF SPEIRKPAES IFRWLNSRSD RSPSTPDKPA
RKRGRPPRKR QTA