Gene RPB_3817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3817 
Symbol 
ID3911620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4358588 
End bp4360384 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content65% 
IMG OID637885718 
Producthypothetical protein 
Protein accessionYP_487422 
Protein GI86750926 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATCT TCGGCGCTCT CACCACCTCG GTCGCGGGCC TGCGTGCGAA CTCCTATGCG 
CTCGAGAACA TCTCGGGCAA CATCGCCAAT TCGCAGACCA CCGCCTTCAA GCGCGTCGAC
ACCTCGTTTC TCGACCTGAT CCCGCAGGCC GGCACCAACC AGCAGCTCGC GGGCTCCGTG
ACCGCGGAAT CGCGGCTGAC CAACACGCTG TCCGGCTCGG TGCAGTCGGC GTCGGTGTCG
ACCTACATGG CGATCAACGG CGAGGGCTTC TTCGCCGTCC AGAAGCCGGG CTCCTTCTCC
GACAACAGCC CGGTGTTCAC CGGCGTCAAC AACTACACCC GCCGCGGCGA CTTCTCGCTC
GACAAGAACG GCTATCTGGT CAACGGCGCC GGCTATTATC TGCAGGGCGT GGCGATCGAT
CCGACCACCG GCAACCCGGT CGGCAGCACG CCGACGGTGC TGAAGTTCCA GAACGACTTT
CTGCCGTCGC AGGAGACGAC CAAGATCAAC TATCGCGCCA ATCTGGCGCG CTATCCGCTC
ACCACGAAGC AGGACACCTC GATCCCCGGA TCGGAACTGC TGCGCGCCGC CGACTTCGCC
AACAACCCGC AGGTCAGCGG CACCACGCCG CCGCCGTTCG GCGACAATTC CAGGGCCGGC
CTGCAGATCA ACGCCAAGGA CGCCACCCCG ATCACCGGCG CCACCACGCT GAGCGGCGCC
GCCAGCACGG ATTCGATCGG CGTCAACTTC GCGGTCGGCG ACACCATCGT CGTCAACGGC
ACCACCATCA CCTTCACCGC GTCCGGCGGC ACCGACGACG CCACCAATAT TCCGGTCGAC
TCGACGATCA CCAATCTGCT GAGCAAGATC GACGCGATTT CCGGCGGCGG CGCGGCGTCG
ACGGTGACGT CGGGCGCGCT CGAATTGCAC ACCGGCACGG CCAGCGATCT GACCATCACC
TCCACCAGCG CAGCCTTCGC CTCGCTCGGC CTGACCAGCC CGTTCAGCGT GATCCGCACC
GGCGGCGGCA CGGTCGGCAC CGGCCAGGTG ATCGGGACGG ACAACCAGAC TTTCCTCGAC
GAGTCGATCT CGGGTGGCGC CACCACCGCC TATGACGGCT CCGGCGCTCC GGTGAACGTG
CAGTTCCGCT GGGCGAAGAT CGACTCCGCG ACGCTCGGCA CCGGCCATTC CGACGTCTGG
AACCTGTTCT ATCAGGTCAA TCCCGACGCC ACCGGCACCG CGGTCGCGTG GCAGAACGTC
AATACCAACT TCACTTTCAA CTCCAGCGGC CAGATGAACC CGGTGATCGG CCAGCTCACG
CTGACCAATC TGACGGTGTC CGGCGTGTCG CTCGGCAACG TGACGATGTC GTTCGGCACC
GGCGGCCTGA CGCAGTTCTC CGACACCAAC GGCAACGTCC AGGTCAACCA GCTGCAGCAG
GACGGCTACG CCGCCGGCCA GCTCACCAGC GTCTCGGTGA GCAACGAGGG CCGCGTCGTC
GGCTCCTACT CCAACGGCCG CAACATCGAT CTCGCCGAAG TCTCGGTCGC CACTTTCAAC
GGCGCCAACT TCCTCAAGCG CATCGACGGC GGCGCGTTCG AGGTGACCAA CGAATCCGGC
GAGGCGCTGT ACGGCAAGGG CGGCAGCATC TCGGGTTCGT CGCTGGAATC GTCGAACACC
GATATCGCCG ACGAATTCAC CAAGCTGATC GTCACCCAGC AGGCCTATTC GGCCAACACC
AAGGTGATCA CCACGGCCAA CACCATGGTG CAGGACCTGC TCAACGTGAT GCGCTGA
 
Protein sequence
MGIFGALTTS VAGLRANSYA LENISGNIAN SQTTAFKRVD TSFLDLIPQA GTNQQLAGSV 
TAESRLTNTL SGSVQSASVS TYMAINGEGF FAVQKPGSFS DNSPVFTGVN NYTRRGDFSL
DKNGYLVNGA GYYLQGVAID PTTGNPVGST PTVLKFQNDF LPSQETTKIN YRANLARYPL
TTKQDTSIPG SELLRAADFA NNPQVSGTTP PPFGDNSRAG LQINAKDATP ITGATTLSGA
ASTDSIGVNF AVGDTIVVNG TTITFTASGG TDDATNIPVD STITNLLSKI DAISGGGAAS
TVTSGALELH TGTASDLTIT STSAAFASLG LTSPFSVIRT GGGTVGTGQV IGTDNQTFLD
ESISGGATTA YDGSGAPVNV QFRWAKIDSA TLGTGHSDVW NLFYQVNPDA TGTAVAWQNV
NTNFTFNSSG QMNPVIGQLT LTNLTVSGVS LGNVTMSFGT GGLTQFSDTN GNVQVNQLQQ
DGYAAGQLTS VSVSNEGRVV GSYSNGRNID LAEVSVATFN GANFLKRIDG GAFEVTNESG
EALYGKGGSI SGSSLESSNT DIADEFTKLI VTQQAYSANT KVITTANTMV QDLLNVMR