Gene RPC_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2014 
Symbol 
ID3973877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2194062 
End bp2195918 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content64% 
IMG OID637925123 
Productflagellar hook-associated protein 
Protein accessionYP_531888 
Protein GI90423518 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACGA CCGCCTTCAG CACCGCGGTG GCCGGTCTTG AGGCCGCGCA GAAAGCTATC 
CGAGTCGTCT CGCAAAACGT GGGGAATGCC GGCACGGCCG GCTACGTCAG ACGCACCGTG
ACAACGGTCG CGTCCGGACC AGGCAATTCC AGCGTTGCTG TCGGGACGGT GAATCGCGTC
TTTGAAGACG CGGCGTTGAA GCAGTTGCGA CTGGAAGCCT CGGGCGCTGC CTACAGCTCG
ACAAAGTCGA ATGTTCTCTC GCAGCTCGAC AAGCTGTTCG GCAAGCCGGG CGATGCCACG
GCGCTCGATG GCCGCATGAA CGCGTTCACA CAGTCGCTGC AAGGGCTCGC AACGGACCCA
AGCTCGGTGG CGAGCCGAAG CACGGTGCTT AACAAAGCCG CCGCTCTCAC CGAACGGATC
CGCAGCCTTG CGGGTAACGT GCAGGAGCTG CGCAGTGGCA TCGAAAATCG GCTCGGCTCG
GATGTCGGGG CCGCAAACAA TCTGTTGAAG TCGATCGCGA GCCTGAACGC CAAGCTGGCC
GCGACGTCCG ATGATTCCGC CCAGGCCGGA CTCCTCGATC AACGGGACCA GCAGATAACG
CAGCTGTCGT CCTATCTCGA CGTGAAATCG ATGCAGCAGC GCGACGGCGC CATGACGCTG
ATGACGACGT CCGGCGTCGT CTTGGTCGAT CGCGACGCAG CGGCAACGCT CTCTTTCGAC
GGCCGCGGCA ATTTGGGGCC GACGTCGGCG TATTCAACAG ATCCGTCCGC GCGTGGGGTC
GGGACCATCA CCGCGACCTT GCCTGGGGGC GCTGTCATCG ACCTTGGTGT TCGTGGCGCG
CTGAGTTCGG GCTCGATCGC AGCACAATTG GAACTGCGTG ACGTCGCTCT TCCGCTGGCC
CAGAGACGGC TCGACGATCT TGCGTTCGGT TTGGCCCAGT CACTCACCGA CAAGAGCGCG
GTCGGAACGC AGAGCGCGGC CGGAGTCGAC CTCAACCTCG CCGATCTCGC CAGTCTCAAG
CCCGGCAATA CCATCACCAT TCCGATCAGC GCGGGCGGCG CGATCCGCAA CGTGATCCTG
GTGGCGTCGT CACTTGCTTC GCGGCCGGTC GACGCAAACC AGACGATCGA CGCCAATGCG
CAAGTCCAAA CCTTCACGAT CCCGGCTGCG CCGGCGACCG CGCGGGATTA TGCGACTGCC
ATTTCGGCTG CGCTCTCGGT CGTTGCTCCG GGCCTCACCG CGGCCAGCAC CACGTCCGGC
AAGCTGACCT TGTCTGGCGC AGGCATTCAG GGCGTCGTTG CGATTAGCAC GCAGCCGAAA
TCGGCGGGCG ATGTTTCTGG GGCTCATCCT TGGATAGCCA TGTTCGTGGA CGGCTTGGGC
AATGCACTCG TCACCGGGTC GCTCGACGGC GCGCCCCAAC GCACCGGCCT AGCTCAGCGC
CTGTCGGTTA ACGCGGCGCT GATCGGCGAC TCTTCGGTTC TCGTTGCAGC CAGCAGAACG
AGTAGCTCGC CCAATCCGTC CCGTCCGCAG TTTGTTTACG ACGCGCTGAC GAGCAGCAAG
CTGGCCTTCT CATCGGTAAA CGGGATTGGC GGCACTCAAG CCCCCCATGT CGCGACGGTC
GCCGCCTTCG CTCAAGAGAT TGTGGCCGCC CAGGGCTCCG CTGCGGCGGC TGCGAAGCAA
ATCGATGACG GGCAGACGGT TGCCCTTGGC GTTGCACAAG GTCGGTTCTC GAAAAGTGCG
GGCGTCAATA TCGATGAGGA AATGTCACGC CTCATGTCGC TGCAGACCGC CTACGCCGCA
AATGCCCGTG TGCTGTCAGC CGTGCGCGAG ATGCTCGACG TTCTGATGAA GATCTGA
 
Protein sequence
MLTTAFSTAV AGLEAAQKAI RVVSQNVGNA GTAGYVRRTV TTVASGPGNS SVAVGTVNRV 
FEDAALKQLR LEASGAAYSS TKSNVLSQLD KLFGKPGDAT ALDGRMNAFT QSLQGLATDP
SSVASRSTVL NKAAALTERI RSLAGNVQEL RSGIENRLGS DVGAANNLLK SIASLNAKLA
ATSDDSAQAG LLDQRDQQIT QLSSYLDVKS MQQRDGAMTL MTTSGVVLVD RDAAATLSFD
GRGNLGPTSA YSTDPSARGV GTITATLPGG AVIDLGVRGA LSSGSIAAQL ELRDVALPLA
QRRLDDLAFG LAQSLTDKSA VGTQSAAGVD LNLADLASLK PGNTITIPIS AGGAIRNVIL
VASSLASRPV DANQTIDANA QVQTFTIPAA PATARDYATA ISAALSVVAP GLTAASTTSG
KLTLSGAGIQ GVVAISTQPK SAGDVSGAHP WIAMFVDGLG NALVTGSLDG APQRTGLAQR
LSVNAALIGD SSVLVAASRT SSSPNPSRPQ FVYDALTSSK LAFSSVNGIG GTQAPHVATV
AAFAQEIVAA QGSAAAAAKQ IDDGQTVALG VAQGRFSKSA GVNIDEEMSR LMSLQTAYAA
NARVLSAVRE MLDVLMKI