Gene Rpal_4438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4438 
Symbol 
ID6412122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4762902 
End bp4765562 
Gene Length2661 bp 
Protein Length886 aa 
Translation table11 
GC content66% 
IMG OID642714320 
Productflagellin 
Protein accessionYP_001993409 
Protein GI192292804 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATG TGGTTCTCTC GGCAGCGGTT CGCCAGAATC TGCTATCGCT GCAGTCGACT 
GCCGATCTGT TGGCGACGAC GCAGAACCGT CTTGCTTCCG GCAAGAAGGT CAACACCGCG
CTCGACAATC CGACCAACTT CTTCACCGCA GCAGGGCTCG ACAGCCGCGC CAGCGACATC
AACAACCTGC TTGACGGCAT CGGCAACGGC GTCCAGATCC TTCAGGCGGC CAACACCGGC
ATCACCTCGC TGAACAAGCT GATCGACACC GCCAAATCGA TCGCCAACCA GGCGCTGCAA
TCGAACGTCG GCTACTCGAC CAAATCCAAC GTTGCGGCGA CCATCGCGGG CGCCACGCCG
AACGATCTGC GCGGCACTCA GACCTTCGCC AGCGCGATCG CGACCAGCAA CGTGATCTAT
GACGGCACCG CCGGCGGCAC CAGCGGCGTG TCGTTGACCG ACACGCTGGG CGGCGGCATC
GGCTCGCTGG TCGGCACCGC GGTCACCAAG AACGTGCCGA CCGACGCGAC CTCGACCGGC
TCAGTGCTCT ATACCGGCAC CGCGACGGCG ACTGCGACCG GGACCGACCT GATCTCGTCG
CTGACCAATG GTTCGACCAC GACCCCGACC GGCCCGCAGG CCGGCGACGT GCTGGTGGTC
GATGGCAAGA CCATCACCTT CACCGCCACC GGCACCCCGA CCATCGACAG CAATGGCAAC
TATACGATCG GCGTCAACCA GCCGATCAAC GCACTGCTCG CCACGGTCGA CACCATCAAC
GGCAACACCA GCAACCCGTC GGTGGTCAAC TCCAGCGGCC ACCTTGAGTT GCACACCGGC
ACCAACCGCG CGCTGTCGGT CTCCGACACC TCGGGCGGCA CCGTGCTGGC CAAGCTTGGC
TTCAGCGCCC CGGTGTCGAC GTCGCTCGGC ACCGGCGCCT CGGCGCCGAT CACCGCCAGC
ACCAAGCTGT TCAACACCGT AGGCGGCCTG GGTGCGGCGA TCGCCGACGG CACGACGCTG
ACGGTGAACG GCAAGACCAT CACCTTCAAG GCGTCCGATC CGCCGTCGCC GGCCGGCCTG
TTGGTCGGTT CGGGCGTGCT CGGCAACATT GTTGCCGACT CGCAGGGCAA CTCGACGATC
TATCTCGGCA CCAGCAATAC CTTTACCAAT GCCACCGTCG GCGACGTGCT GACGTCGATC
GATCTCGCCA GCGGCGTGAA GTCGGCGACG ATCTCGGGCG GCATTGCGAC CTTTACGGCC
AACGGCACGG CGTCTTCGAT CAGCGGCGCC GGCGTGGTGA CGTTGAACAC TTCGACCGGC
GCCGATCTCC ACGTCACCGG TCCTGCCGAC TTCCTGAAGT CGCTGAACCT CACCACCTCG
ACCGGCTCCG GCCCGACCAC GCTCGACGTT CCGCGCATCA CCGGTGCAAA CACGATCGGC
ACGCTGATCG ACGACGGCTC GATCCTCACC GTCAACGGCA AGACTATCAC CTTCAAGAAC
GCCCCGGTGC CGCTCGCCTC GGCGACCCAC ACCGGCGTGA GTGGTCATGT CGAGACTGAC
GGTATCGGCA ATTCGACGGT CTATCTGCAG GGCGGCACCG TCGCCGACGT GCTGAAGGCG
ATCGACCTTG CCACCGGCGT GCAGACCGCG ACGCTGTCCC AGACCGGTGC GACGCTGACG
ACCCAGCTCG GCTCCGCCAA TTCGTCGCTG TCGAGCGGTT CGCTGAAGAT CTCGACCGGC
AGCGCTTCCG ACCTCACCAT CAGCGGCACC GGCAATGCGA TGCTGGCGCT CGGCCTCGCC
GGCAACACCG GCACCTCGAC GGTGTTTCAG GCGTCGCGCG CTTCGGGCGC CGGTGGTGTC
TCCGGCAAGA CCATGACCTT CACGTCGTTC AAGGGCGGCT CGCCGGTCAG CGTCACCTTC
GGCGACGGCA CCGGCGGCAC GGTGAAGACG CTGGCGCAGC TCAACGCCGC GCTCGCGCCG
AACAACCTGA CCGCGCAGGT CGATGCCAAT GGCGTGCTGA TGATCTCGGC GAGCAACGAC
TACGCGTCGT CGACGCTCGG CTCGGTCGCG GACGGCGGCG TGCTTGGTGG CTCGATCACC
TCGACGCTGA CCTTCACCAC GCCGAACCCG CCGGTCGCCG ACCTCAATTC GCAGGCAGCG
CGCGCCAAGC TGGTCGAGCA GTACAACAAC GTCATCGCGC AGATCACCAC CACGTCGCAG
GACGCTTCGT TCAACGGCGT GAACCTGCTG AACGGCGACA CGCTGAAGCT GGTGTTCAAC
GAAACCGGCA AGTCGACGCT GACCATCGTC GGCACGGCGC TGACGCCAGC GGCACTCGGC
CTGCCGACGC TGGTGGCAGG CACGGACTTC ATCGACAACG CAGCCACCAA CAAGACGCTG
GCTGCGCTCA ACACCGCATC GACCACGCTG CGGTCGCAGG CGTCGTCGTA CGGTTCCAAT
CTGTCGATCG TGCAGATCCG TCAGAACTTC GCCAAGAGCC TGATCAACGT GCTGCAGACC
GGCTCTGCGA ACCTGACGCT CGCCGATACC AACGAAGAAG CGGCCAACAG TCAGGCGCTG
TCGACGCGGC AATCGATCGC GGTGTCGGCG CTGTCGCTGG CCAACCAGTC TCAGCAGAGC
GTCCTGCAGC TGCTGCGTTA A
 
Protein sequence
MADVVLSAAV RQNLLSLQST ADLLATTQNR LASGKKVNTA LDNPTNFFTA AGLDSRASDI 
NNLLDGIGNG VQILQAANTG ITSLNKLIDT AKSIANQALQ SNVGYSTKSN VAATIAGATP
NDLRGTQTFA SAIATSNVIY DGTAGGTSGV SLTDTLGGGI GSLVGTAVTK NVPTDATSTG
SVLYTGTATA TATGTDLISS LTNGSTTTPT GPQAGDVLVV DGKTITFTAT GTPTIDSNGN
YTIGVNQPIN ALLATVDTIN GNTSNPSVVN SSGHLELHTG TNRALSVSDT SGGTVLAKLG
FSAPVSTSLG TGASAPITAS TKLFNTVGGL GAAIADGTTL TVNGKTITFK ASDPPSPAGL
LVGSGVLGNI VADSQGNSTI YLGTSNTFTN ATVGDVLTSI DLASGVKSAT ISGGIATFTA
NGTASSISGA GVVTLNTSTG ADLHVTGPAD FLKSLNLTTS TGSGPTTLDV PRITGANTIG
TLIDDGSILT VNGKTITFKN APVPLASATH TGVSGHVETD GIGNSTVYLQ GGTVADVLKA
IDLATGVQTA TLSQTGATLT TQLGSANSSL SSGSLKISTG SASDLTISGT GNAMLALGLA
GNTGTSTVFQ ASRASGAGGV SGKTMTFTSF KGGSPVSVTF GDGTGGTVKT LAQLNAALAP
NNLTAQVDAN GVLMISASND YASSTLGSVA DGGVLGGSIT STLTFTTPNP PVADLNSQAA
RAKLVEQYNN VIAQITTTSQ DASFNGVNLL NGDTLKLVFN ETGKSTLTIV GTALTPAALG
LPTLVAGTDF IDNAATNKTL AALNTASTTL RSQASSYGSN LSIVQIRQNF AKSLINVLQT
GSANLTLADT NEEAANSQAL STRQSIAVSA LSLANQSQQS VLQLLR