Gene Rpal_4452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4452 
Symbol 
ID6412136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4781174 
End bp4783840 
Gene Length2667 bp 
Protein Length888 aa 
Translation table11 
GC content65% 
IMG OID642714334 
Productflagellin 
Protein accessionYP_001993423 
Protein GI192292818 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.438814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGTA TCGTTCTATC CAACGCCGTT CGCCAGAATC TTTCTTCGCT CCAGGCCACG 
GCTGACTTGC TCGCCACCAC CCAAAGCCGC CTCTCGTCCG GCAAGAAGGT GAACACGGCG
CTCGATAATC CGACTAACTT CTTCACCGCC GCTTCGCTCG ACAGCCGCGC CAGCGACATC
AACAACCTCC TCGATGGCAT CGGCAACGGC GTGCAGATCC TGCAGGCCGC CAATACCGGC
ATCACCTCGC TGAACAAGCT GGTGGACAGC GCCAAGTCGA TCGCCAACCA GGCCCTGCAG
ACGACCTCCG GCTACGCCAC CAAGTCGAAC GTGTCGGCCA CCATCTCCGG CGCCACCGCT
GACGACCTGC GCGGCACCCA GAGCTACTCG AACGCGGTTG CCACCGGCAA CGTGATCTTC
GACGGCACCG CGGGTGGCAG CACCGCTGCG TCCGGCACCG ACACCCTCGG TGGCGCGATC
GTCAGCATCG CGGCGGGTAC GGCTGTGACC GCTCTCGGCG CCGCTGACAA CACCGCGCTC
GGCAGCGTTC TCAGCGTCGG CACCGCCGCC GCCACCGCGG GCGGCTCCAA CCTGATCAGC
GATCTCACCA ACGGTTCGAC CACCACGGCG ACCGGTCCGG CTGCGGGCGA CTCGATCACG
GTGAACGGCA AGACCATCAC CTTCACGACT GCCGGTGCCG CCAGCAAGGA CAGCAACGGC
AACTACACGA TCGGTCTCGA CCAGACCCTG ACCAAGCTGG CCAACACGAT CGACGATATC
AACGGCAACA CCGGCCATTC GTCGACCATC ACCGCCGGCA AGCTGGAACT GCACTCGGGC
ACCAACAGCC CGCTGACGAT CGGCGACAAC GCCGGCGGCG CCGTGCTGGC CAAGCTCGGC
CTGACCGCGC AGACCGTCGA CACCGCGGCT GCGACCGCCT CGGCCAACAT CTCGGCCACG
ACGCAGCTGT TCAACACCCA TGGTGGCCTC ACCACCGCGG CGATCGCGGA CGGCACCCAG
CTGACGGTCA ACGGCAAGAC CATTACGTTC AAGACCTCCG ACGCTCCGCA GGGCAATAAC
ATCCCGACGG GCACCGGTGT TCTCGGCCGT ATCGGCACCG ACGGCAACGG CAATTCGACG
ATCTATCTCG GCGACCAGAC CAAGTTCAGC AACGCGACCG TTGGTGACCT GCTGACCGCG
ATCGATCTGG CCAACGGCGT CAAGTCGGCG ACCATCTCGT CGGGTGTCGC AACGATCAGC
ACCAACTCCG GCCAGACTGC TTCGGACGTC ACCGGTGGTA TCACCACCAT CCGCAGCTCG
ACCGGCGCCG ACCTCAACGT CACCGGCTGG ACCGACCTGT TCAAGAACCT CGGTCTGACC
AGCGCTACCG GTACCGGTCC GCTGACCCTC ACCAAGCAGC GCACCACCAG CGGCACCACG
ATGGGCACGC TGATCGCGGA CGGCTCCACG CTGAACGTGA ACGGCAAGAC CATCACCTTC
AAGAACGCCG CTGTTCCGAC TGCGTCGTCG ACCCACCAGG GCATCTCCGG CAACGTCGAG
ACCGATGGCC AGGGCAATTC GACCGTGTAC CTGCAGAAGG GCACCATCGA TGACGTCCTG
AAGGCCATCG ACCTCGCCAC CGGCGTCCGC ACGGCTTCGC TGGGTGTCTC CGGTGCCACG
ATCTCGACTG CCAACGGCAC GGCCAACTCG TCGATCACGA GCGGCTCGCT GAAGCTGTCG
ACCGGACTTG CCTCCGACCT TAGCATCACC GGCACCGGCA ACGCGCTGGC TGCCCTCGGC
CTCACCGGCC CGAGCGGCAT CTCCACCTCG TTCACCTCGG CTCGTGGCGC TTCGGCCGGC
AGCCTGAACG GCAAGACGCT GACCTTCACC TCCTTCAACG GCGGTGCGGC CACCAACGTC
ACCTTCGGCG ACGGCACCAA CGGCACCGTC AAGTCGCTCG CTCAGCTGAA CGCTGCGCTG
GCGTCCAACA ACCTGACGGC GTCGATCGAC AACGCCTCCG GCAAGCTCAC GATCGCAGCG
TCGAACGACT ACGCCTCCCA CACTCTGGGT GGCTCGGACG GCGGTGTGAT CGGCGGTACC
CTGGCTTCGA CCCTGACCTT CTCGGTGCCG AACGCGCCGG TGGTCGACGT CAACGCCCAG
ACCACCCGCG CCGGCCTGGT CAAGCAGTTC AACGACGTGC TCGACCAGAT CAAGACCACG
GCTCAGGACG CTTCGTTCAA CGGTGTGAAC CTGCTGAACG GCGACACCCT GAAGCTGGTG
TTCAACGAAA CCGGCAAGTC GACGATCTCG ATCCAGGGCG TCACCTTCAA CCCGACCGGC
CTTGGCCTGT CGAACCTGAG CTCTGGCGTC GACTTCATCG ACAACAACGC CACCAACGCC
GTGCTGAGCA AGCTGAGCAC CGCTTCGACC GCCCTGCGGT CGCAGGCCTC CGCGTTCGGT
TCGAACCTGT CGATCGTGCA GGCCCGTCAG GACTTCTCGA AGAGCCTGAT CAACGTGCTG
CAGACCGGTT CGTCGAACCT CACGCTGGCC GACACCAACG AGGAAGCGGC GAACAGCCAG
GCGCTGACGA CCCGCCAGTC GATCGCGGTG TCCGCGCTGT CGCTGGCCAA CCAGTCTCAG
CAGGGCGTGC TGCAGCTCCT CCGCTAA
 
Protein sequence
MSGIVLSNAV RQNLSSLQAT ADLLATTQSR LSSGKKVNTA LDNPTNFFTA ASLDSRASDI 
NNLLDGIGNG VQILQAANTG ITSLNKLVDS AKSIANQALQ TTSGYATKSN VSATISGATA
DDLRGTQSYS NAVATGNVIF DGTAGGSTAA SGTDTLGGAI VSIAAGTAVT ALGAADNTAL
GSVLSVGTAA ATAGGSNLIS DLTNGSTTTA TGPAAGDSIT VNGKTITFTT AGAASKDSNG
NYTIGLDQTL TKLANTIDDI NGNTGHSSTI TAGKLELHSG TNSPLTIGDN AGGAVLAKLG
LTAQTVDTAA ATASANISAT TQLFNTHGGL TTAAIADGTQ LTVNGKTITF KTSDAPQGNN
IPTGTGVLGR IGTDGNGNST IYLGDQTKFS NATVGDLLTA IDLANGVKSA TISSGVATIS
TNSGQTASDV TGGITTIRSS TGADLNVTGW TDLFKNLGLT SATGTGPLTL TKQRTTSGTT
MGTLIADGST LNVNGKTITF KNAAVPTASS THQGISGNVE TDGQGNSTVY LQKGTIDDVL
KAIDLATGVR TASLGVSGAT ISTANGTANS SITSGSLKLS TGLASDLSIT GTGNALAALG
LTGPSGISTS FTSARGASAG SLNGKTLTFT SFNGGAATNV TFGDGTNGTV KSLAQLNAAL
ASNNLTASID NASGKLTIAA SNDYASHTLG GSDGGVIGGT LASTLTFSVP NAPVVDVNAQ
TTRAGLVKQF NDVLDQIKTT AQDASFNGVN LLNGDTLKLV FNETGKSTIS IQGVTFNPTG
LGLSNLSSGV DFIDNNATNA VLSKLSTAST ALRSQASAFG SNLSIVQARQ DFSKSLINVL
QTGSSNLTLA DTNEEAANSQ ALTTRQSIAV SALSLANQSQ QGVLQLLR