Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3800 |
Symbol | |
ID | 3911603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4333358 |
End bp | 4336018 |
Gene Length | 2661 bp |
Protein Length | 886 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885701 |
Product | flagellin |
Protein accession | YP_487405 |
Protein GI | 86750909 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.139024 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATG TGGTTCTTTC AGCAGCAGTT CGCCAGAATC TGCTTTCGCT GCAATCGACG GCGGATCTGC TCTCGACCAC CCAGAATCGT CTTGCGAGCG GCAAGAAGGT CAACACCGCG CTCGACAATC CGACCAACTT CTTTACCGCC GCGGGGCTCG ACAGCCGCGC CAGCGACATC AACAACCTGC TCGACGGCAT CAGCAACGGC GTCCAGATCC TGCAGGCGGC CAACACCGGC ATCACCTCGC TGAACAAGCT GATCGACACC GCCAAGTCGA TCGCCAACCA GGCGCTGCAG TCGAACGTCG GCTACTCCAC GAAATCCAAC GTCTCGGCGA CCATCGCGGG CGCCACCCCG GACGATCTGC GCGGCACCCA GACCTTCGCC AGCGCGACCG CCACCAGCAA CGTGGTCTAT GATGGCACCG CCGGCGGCAC CAACGGCGTG TCGCTGACGG ATACGCTCGG CGGCGGCGTC GGCAGCATCA CCGGCACCAA CATCACCAAG GCGGTCGCGG CCGACGCGAC GGCCACCGGC GGCGTACTCT ACACCGGCAC GGCGACGGCG ACGGCGACCA GCGCCGACCT GATCAGTTCG CTGACCAACG GCTCGACCGT GACGCCGACC GGCCCGCAGG CCGGCGACAT CATCGTGGTC AACGGCAAGA ACATCACCTT CACCACCACG GGCTCGGCGA CCGCAGACAG CAACGGCAAC TATACGATCG GCATCAACCA GCCGATCAGC GCGCTGCTGG CGAGCATCGA CACCATCAAC GGCAACACCA GCAACCCGTC GGTCGTCGAC GCCAACGGCC ATATCCAGCT CCACACCGGC ACCAACCGTT CGCTGTCGAT CAGCGACACC AGCAGCGGCA CGGTGCTGGC GAAGCTCGGC TTCGGTTCGA CGGTCACGGT CCCGCTCGGC ACCGGCGCCG CCACGGCGAT CACCGCGACC ACGAAGCTGT TCAATTCGGT CGGCGGCCTC GGACCGGCGA TCGCCGACGG CACCACGCTG ACGGTCAACG GCAAGTCCGT CACCTTCAAG GCGAGCGATC CGCCGAGCGC CGCGGGCCTG CTCGCGGGCT CCGGCGTGCT CGGCAATATC GTCACGGATA CGGCCGGCAA CTCCACCATC TATATGGGGA CGAGCAACAC CTACACCTCG GCCACTGTCG GCGACGTGCT CACCGCGATC GATCTCGCCA GCGGCGTCAA GTCGGCGACG ATCGCCAACG GCATCGCGAC CTTCGCGGCC AACGGCACGC CGTCGCAGAT CTCCGCCGGC GGCGCGGTGA CGCTGCAGAC TTCGACCGGC GCCGATCTCA GCATCACCGG CCCGGCCGAC TTCCTGAGCT CGCTGAACCT GACCGCGTCG ACCGGCCCGG GCCCGGCGAC GCTCACCGCC ACCCGTTCGA CCGGCGCCGG CACCATCGGC ACGCTGATCG AGGACGGCTC GACGCTGAAC GTCAACGGCA ACATCATCAC CTTCAAGAAC GCCCCGGTGC CGCTCGCCTC GGCCAGCCAC ACCGGCATCA GCGGCCATGT CGAGACCGAC GGTCTCGGCA ATTCGACCGT GTATCTGCAG GGCGGCACCA TGGCCGACGT GCTGAAGGCG ATCGACCTCG CCACCGGCGT GCAGACGGCG ACGCTGTCGC AGACCGGCGC GACGCTGACG ACGCAGACCG GCTCGGCCAA CTCGTCGCTG TCCAGCGGCT CGCTGAAGGT CTCGACCGGC AGCGCCTCCG ACCTCACCAT CAGCGGCACC GGCAACGCGA TGCTGGCGCT GGGCCTCGCC GGCAACACCG GCACCTCGAC CGAGTTCAAG GCGTCGCGCT CGTCCGGCAC CGGCGGCGTC AGCGGCAAGA CGCTGAGCTT CACCTCGTTC AAGGGCGGCA CCCCGGTCAG CGTCACCTTC GGCGACGGCA CCGGCGGCAC CGTGAAGACG CTGTCGCAGC TCAACGTCAA GCTGGCGACC AACAACATGA TCGCGCAGAT CGACGCCAAC GGAAAGCTGA CGATCTCGTC GAACAACGAC TACGCCTCCG CGACGCTCGG CTCGACCACG GACGGCGGCA CGCTCGGCGG CACCATCACC GCGACGCTGA CCTTCTCGAC GCCGAACCCG CCGGAACCGG ACGTCACCGC GCAGGTGGCG CGCGCCAAGC TGGTCGAACA GTACAACAAC GTCATCCAGC AGATCACCAC GACGTCGCAG GACGCGTCGT TCAACGGCGT CAATCTGCTC AACGGCGATA CGCTGAAGCT GGTGTTCAAC GAGACCGGCA AGTCGACGCT GAACATCGTC GGCACCGCGC TGAGCCCGGC GGCGCTCGGC CTGCCGACGC TGGTGTCGGG CGTCGACTTC ATCGACAACG CCTCGACCAA CAAGACGCTG GCCTCGCTCA ACACCGCGGC GACCACGCTG CGGTCGCAGG CGTCGTCCTA CGGTTCCAAC CTGTCGATCG TGCAGATCCG GCAGGACTTC GCCAAGAACC TGATCAACGT GCTGCAGACC GGCTCGTCGA ACCTGACGCT GGCCGACACC AACGAGGAAG CCGCCAACAG CCAGGCGCTG TCGACCCGCC AGTCGATCGC GGTCTCGGCG CTGGCGCTGG CCAACCAGTC GCAGCAGAGC GTGCTGCAGC TGCTGCGATA A
|
Protein sequence | MSDVVLSAAV RQNLLSLQST ADLLSTTQNR LASGKKVNTA LDNPTNFFTA AGLDSRASDI NNLLDGISNG VQILQAANTG ITSLNKLIDT AKSIANQALQ SNVGYSTKSN VSATIAGATP DDLRGTQTFA SATATSNVVY DGTAGGTNGV SLTDTLGGGV GSITGTNITK AVAADATATG GVLYTGTATA TATSADLISS LTNGSTVTPT GPQAGDIIVV NGKNITFTTT GSATADSNGN YTIGINQPIS ALLASIDTIN GNTSNPSVVD ANGHIQLHTG TNRSLSISDT SSGTVLAKLG FGSTVTVPLG TGAATAITAT TKLFNSVGGL GPAIADGTTL TVNGKSVTFK ASDPPSAAGL LAGSGVLGNI VTDTAGNSTI YMGTSNTYTS ATVGDVLTAI DLASGVKSAT IANGIATFAA NGTPSQISAG GAVTLQTSTG ADLSITGPAD FLSSLNLTAS TGPGPATLTA TRSTGAGTIG TLIEDGSTLN VNGNIITFKN APVPLASASH TGISGHVETD GLGNSTVYLQ GGTMADVLKA IDLATGVQTA TLSQTGATLT TQTGSANSSL SSGSLKVSTG SASDLTISGT GNAMLALGLA GNTGTSTEFK ASRSSGTGGV SGKTLSFTSF KGGTPVSVTF GDGTGGTVKT LSQLNVKLAT NNMIAQIDAN GKLTISSNND YASATLGSTT DGGTLGGTIT ATLTFSTPNP PEPDVTAQVA RAKLVEQYNN VIQQITTTSQ DASFNGVNLL NGDTLKLVFN ETGKSTLNIV GTALSPAALG LPTLVSGVDF IDNASTNKTL ASLNTAATTL RSQASSYGSN LSIVQIRQDF AKNLINVLQT GSSNLTLADT NEEAANSQAL STRQSIAVSA LALANQSQQS VLQLLR
|
| |