Gene RPC_2013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2013 
Symbol 
ID3973876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2192636 
End bp2194030 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content62% 
IMG OID637925122 
Producthypothetical protein 
Protein accessionYP_531887 
Protein GI90423517 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.918287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA CCGCCGCAAT CAACACGGCA ATCAGTGGAA TGGCCGCGCA GTCCTATGCC 
ATCGGCAACA TCTCAGGGAA CATAGCGAAC GCGCATACAC CAGGGTTCAA GCGGATCGAT
ACCAGCTTCG CGGATCTCGT CTCCAATCAG GGTCCCAAGC GTACCCTGGC GGGACCTGTC
CTTGCAGAGG CAAGGTTCAC AACGTCCGTG CAAGGCCCTC TGACCGCCAC TGGCGTCGGC
ACCAACATGG CGATCAATGG CTCGGGCTTC TTCACCGTCA TGCAGCGGGA GTCCGGCTCC
GCACAATCCG TGTACACCCG CCGCGGTGAC TTCGCGGTCG ATAAACAAGG GTATCTGGTC
AACGGAACTG GCGGTTATCT ACTCGGCAAC AATCTCGATC CCGTGTCAGG CCGAGTAACG
TCTTCCGGTC TGATCAAGAT CGCCAATACC ACCTTGCCCG GGCGGCAGAC CACGCGGATC
GACTACGCAG CCAACCTGCC GAAGGTGCCG ACGACGACGG CTTCGGTCGG CGGCGATTCA
TCGCCTTACA CGGTCGCGAG CGCTGTTGTC ATTGACCCGA CACTGACAGC ACCGGACCTC
ACCAAGAAGG TGGTCGGGGC AGCGCAGGTG CCGGCCTTCC TCGATAAGTC GCTGATGGGC
CCGTCGCTGA CGATGTATAT GGCCGGAGGA AGCCCGGCGT CGCTGTCGAC GCGCTGGGCC
AAGGTGCAGG ACGCCGCAGC CACGGCGACG CCGCCGAAGA ACGCGATCTG GAATCTGTTC
TATGCGTCGA GCTCGACTGC CGCCAAGGAC AGCGACTGGG TCAATGTCGG ATCGGCCTTC
TCGTTCGACT CCGCCGGAAA GCTCGTACCG CCGGCCCATG CAACCGTATC AAGCAGCGGA
GCCGTCTCTC TCAAGATCCC GCAAGTCGCC GTCGATGGTG TGAGCGTCGG CGACATCACG
ATGAATCTCG GCAGCAGCGG GCTCACTCAA AATGCCGCCT CCGCCGGCAC CGTGACGACC
AACGCGCTGG CACAGGACGG CTACGCGTCG GGAAGCCTGA AAAGCCTTTC CGTCAGCGGG
GATGGCACGA TCGTCGGCTC GTTCTCCAAT GAGATGACAG CCTCCGTCGC AACGGCAGGC
GTCGTCAACT TCATGAATCC CAATGGCCTG CGACCGACGT CCGGCGGCAA CTACGAGCAG
TCTCGCGATT CCGGCGCGCC GCTCGCGGGC CTCAACGGTG GAACCATCGT TGGAGGCAAC
ATCGAGGGCT CCAATTCAGA CGTCGCGGGG CAGTTCTCGA AACTGGTGGC AACGCAGCAG
GCTTACTCCG CGAACGTGAA GGTCATGACG ACCGCGAACC AGATGATGGC GGATCTGCTC
AATGCAGTTC GCTGA
 
Protein sequence
MSITAAINTA ISGMAAQSYA IGNISGNIAN AHTPGFKRID TSFADLVSNQ GPKRTLAGPV 
LAEARFTTSV QGPLTATGVG TNMAINGSGF FTVMQRESGS AQSVYTRRGD FAVDKQGYLV
NGTGGYLLGN NLDPVSGRVT SSGLIKIANT TLPGRQTTRI DYAANLPKVP TTTASVGGDS
SPYTVASAVV IDPTLTAPDL TKKVVGAAQV PAFLDKSLMG PSLTMYMAGG SPASLSTRWA
KVQDAAATAT PPKNAIWNLF YASSSTAAKD SDWVNVGSAF SFDSAGKLVP PAHATVSSSG
AVSLKIPQVA VDGVSVGDIT MNLGSSGLTQ NAASAGTVTT NALAQDGYAS GSLKSLSVSG
DGTIVGSFSN EMTASVATAG VVNFMNPNGL RPTSGGNYEQ SRDSGAPLAG LNGGTIVGGN
IEGSNSDVAG QFSKLVATQQ AYSANVKVMT TANQMMADLL NAVR