Gene RPD_2370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2370 
Symbol 
ID4022859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2645785 
End bp2647998 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content64% 
IMG OID637962563 
Productflagellin 
Protein accessionYP_569503 
Protein GI91976844 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGTA TCGTTCTCTC CAATGCGGTC CGCCAGAATC TTTCCTCGCT GCAGGCCACC 
GCGGACTTGC TCGCCACCAC CCAGAGCCGG CTTTCGTCGG GCAAGAAGGT GAACTCGGCG
CTCGATAATC CCACCAACTT CTTCACCGCA GCTTCGCTCG ATTCGCGCTC CAGCGACATC
AACAACCTGC TCGACGGCAT CGGCAACGGC ATTCAGATCA TTCAGGCCGC CAACACCGGC
ATCAGCTCGC TGACCAAGCT GGTGGACAGC GCCAAGTCGA TCGCCAACCA GGCGCTGCAG
TCGGTCGCCG GCTACAGCAG CAAATCGAGC GTCACGACCA CGATCGCCGG CGCCACCGCG
GACGACCTGC GCGGCACCTC GACCTATTCC AACGGCCTCG CGCAAAGCAT CGGTCTGCAG
GACGGCCAGG GCACGCCCGG CGTTGTCGAT GGTGATACCC TGCTCGGCGG CGTCGCTGCG
ACCAAGACCG GCGGAACCGT CGGTGGTAGT GGCATCACCG CAGGCACCGC GCTGAGCGCG
CTGGGCGCGA ACAAGCCGGT GGCCGGCGAC ACCATGACGG TGAACGGTCG CACCATCACC
TTCGCAAGCG GCGGCGCTCC GGACAAGGCT ACCCTGCCGA CCGGCTCGGG TGTCGAAGGT
CAGCTCGTCA CAGACGGCAA GGGCAACTCC ACCGTCTTTC TGGACAGCGG TACTGTTCAG
GACGTGATGA ACGCGATCGA CCTCGCCAGC GGCGTTCAGA AAGTGACGAT CACCGGTGGC
GACGCTACGC TGGCGCCCAG TTCTGGTACT GCCGCTGCGG TCACGTCGAA CGCGCTTGTG
CTGTCGACCT CGACGGGTTC GGATCTGTCG ATCTCCGGCA ACAACACGCT GTTGTCGGCC
TTCGGCTTGA ATTCGGGCGC CACCGGCGCC GGTACCTTCA AGGCCGAACG CACTGCCAGC
CCTGCTGCAG GCGACGGCGT CAGCCGCGCC AACATGATCC AGGCCGACTC GACGCTCAGC
ATCAACGGCA AGACCATCAC CTTCAAGGAT GCCGCGATCC CTGCAAACGC TGACTATGGC
TTCGGCAAGG TCGGCAGCCA GAACGTCATC ACCGACGGCA ACGGCAACTC CACTGTCTAT
CTGCAGGGCG GCACGATCAA GGACGTGCTC ACCGCGGTCG ACATCGCGAG CGGCGCGCAG
ACCGCGCCGG TCAGCAACGG CGCAGCCTCC CTCGCTGTGA CAGCGGGCAG CGAAGCCTCC
AAGGTGCTCA GTGGCGGTCA GTTGCAGATC AGCTCCGGTC TGGCCGGCGA TCTGAAGATC
AGCGGCACCG GCAATGCGCT GTCGGCGCTG GGCCTCGCCG GCAATCAGGG AACCGCGACC
AGCTTCTCGG TCGCGCGGAC CGCCACTGCC GGCGGAATCA CCGGCAAGAC GTTGTCGTTC
GAAGCCTTTA ATGGCGGCAC CGCGGTCAAC GTGACCATCG GCGACGGCAC CAACGGCACC
GTGAAGTCGC TGGCCGACTT GAACTCGGCG CTGTCGGTCA ACAATCTGGC GGCGTCGATC
GACACCACCG GCAAGCTGAC CATCTCGGCG TCCAACGACT ATGCCTCCTC GACGATCGGC
TCGACCGAAT CGGGCGGCAA GATCGGCGGC ACCGCGGCGT CGCTGTTCTC GACGGCTTCG
GCTCCTGTTG CCGACGTCAA CGCCCAGAAC ACCCGCGCCA ATCTGGTGAC GCAGTACAAC
AACATCATTC AGCAGATCAA AACCACTGCT CAGGATGCGT CGTTCAACGG CGTCAACCTG
CTCGGCGGCG ACACGCTGAA GCTGGTGTTC AACGAAACCG GCAAGTCCAC CCTGAGCATT
CAGGGCGTCA CCTTCGACCC GGCCGGCCTC GGCCTGTCGA GCCTGAAGTC GGGCAAGGAC
TTCATCGACA ATGCGAACAC CAACAAGGTG CTGTCGTCTC TGAACACCGC GTCGAGCACG
CTGCGTTCGC AGGCCTCGGC GTTGGGCTCG AACCTGTCGA TCGTGCAGAC CCGTCAGGAC
TTCTCGAAGA ACCTGATCAA CGTGCTGCAG ACCGGCTCGT CCAACCTGAC GCTGGCCGAC
ACCAACGAGG AAGCGGCCAA CAGCCAGGCG CTGTCGACCC GCCAGTCGAT CGCGGTGTCC
GCGCTGTCGC TGGCCAACTC GTCGCAGCAG AGCGTGCTGC AGCTACTGCG TTAA
 
Protein sequence
MSGIVLSNAV RQNLSSLQAT ADLLATTQSR LSSGKKVNSA LDNPTNFFTA ASLDSRSSDI 
NNLLDGIGNG IQIIQAANTG ISSLTKLVDS AKSIANQALQ SVAGYSSKSS VTTTIAGATA
DDLRGTSTYS NGLAQSIGLQ DGQGTPGVVD GDTLLGGVAA TKTGGTVGGS GITAGTALSA
LGANKPVAGD TMTVNGRTIT FASGGAPDKA TLPTGSGVEG QLVTDGKGNS TVFLDSGTVQ
DVMNAIDLAS GVQKVTITGG DATLAPSSGT AAAVTSNALV LSTSTGSDLS ISGNNTLLSA
FGLNSGATGA GTFKAERTAS PAAGDGVSRA NMIQADSTLS INGKTITFKD AAIPANADYG
FGKVGSQNVI TDGNGNSTVY LQGGTIKDVL TAVDIASGAQ TAPVSNGAAS LAVTAGSEAS
KVLSGGQLQI SSGLAGDLKI SGTGNALSAL GLAGNQGTAT SFSVARTATA GGITGKTLSF
EAFNGGTAVN VTIGDGTNGT VKSLADLNSA LSVNNLAASI DTTGKLTISA SNDYASSTIG
STESGGKIGG TAASLFSTAS APVADVNAQN TRANLVTQYN NIIQQIKTTA QDASFNGVNL
LGGDTLKLVF NETGKSTLSI QGVTFDPAGL GLSSLKSGKD FIDNANTNKV LSSLNTASST
LRSQASALGS NLSIVQTRQD FSKNLINVLQ TGSSNLTLAD TNEEAANSQA LSTRQSIAVS
ALSLANSSQQ SVLQLLR