Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3814 |
Symbol | |
ID | 3911617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4352087 |
End bp | 4354753 |
Gene Length | 2667 bp |
Protein Length | 888 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885715 |
Product | flagellin |
Protein accession | YP_487419 |
Protein GI | 86750923 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGTA TCGTTCTCTC GAATGCCGTC CGCCAGAACC TCTCGTCGCT TCAGGCCACG GCAGACTTGC TCGCCACCAC CCAGAGCCGG CTCTCGTCCG GTAAGAAGGT GAACTCGGCG CTCGACAATC CGACCAACTT CTTCACCGCC GCCTCGCTCG ACGCCCGCGC CAGCGACATC AACAACCTGC TCGACGGCAT CAGCAGCGGC GTGCAGATCC TGCAGGCCGC CAACACCGGT ATCACTTCGC TGACGAAGCT CGTCGACAGC GCCAAGTCGA TCGCCAACCA GGCCCTGCAG ACGACCTCGG GCTATGCCAC GAAGTCGAAC GTGTCCGCCA CCATCTCCGG CGCCACCGCC GAGGATATCC GCGGCACCCA GAGCTTCGAC AATGCGGTTG CGACCGGCAA CGTGATCTTC GACGGCACCA CCGGCGGCAC CACGGCTGCG GCCGGCACCG ATACGCTCGG TGGCGCGATC GTCAGCATCG CGGCTTCGGC GGCTGTGACC GTCCTCGGTG CGGTCGACGC GACGGCGACC GGCGCCGTGC TCTCGGTCGG CACCGCCGCC GCGACCGCGG CCGGCACCAA CCTGATCAGC GAACTGACCA ACGGTTCGAC CGTGACGGCG ACCGGCCCTG CCGCGGGTGA CTCGATCACC GTCAACGGCA AGACCATCAC CTTCACGACC GCCGGCGCGT CCAGCAAGGA CTCCTCGGGC AACTACACGA TCGGTCTCGA CCAGACCGTC AACTCGCTGC TCGACACCAT CGACACCGTC AACGGCAACA CCGGCCTGTC CTCGGCCGTT ACCGCCGGCC GGCTCGAACT GCACTCGGGC ACCAACAGCC CGCTGACCGT TGGTGATAAC GCCGGTGGCG CCGTTCTGGC CAAGCTCGGC CTGACCGCGC AGACCGTCGA TACCGCTGCC GCCACCGCGT CGGCGAACAT CTCGGCGAGC ACGCAGCTGT TCAACACCCA TGGTGGCCTG ACCACCACGG CGATCGCCGA CGGCACGACC TTGTCGGTCA ACGGCAAGAC CATCACCTTC AAGACCGCCG ACGCGCCGCA GGGCAACAAC ATCCCGACCG GCACCGGCGT CCTCGGCCGC ATCGGCACCG ACGGCAACGG CAACTCGACG ATCTATCTCG GCGACCAGAC CAAGTTCACC AACGCGACGG TGGGCGATCT GTTGACCGCG ATCGATCTCG CGAACGGCGT CAAGGCGGCG TCGATCTCGT CGGGTGTCGC GACCATCAGC ACCAACTCCG GCCAGACGGC TTCCGCCGTC GCCGCCGGCA TCACCACCAT CCAGAGCTCG ACCGGTGGCG ATCTGAACGT CACGGCCTTC ACCGACCTGT TCAAGAACCT CGGCCTGACC ACTTCGAGTG GTACTGGTCC GCTGACCCTG ACCAAGCAGC GCACCACCAG CGGCACCACG ATGGGCACGC TGATCGCCGA CGGCTCGACC TTGAACGTGA ACGGTAAGAC CATCACCTTC AAGAACGCCG CGGTTCCGAC CGCGTCGTCG AGCCACACCG GCATCTCCGG CAACGTCGAA ACCGACGGCA GCGGCAATTC GACCGTGTAC CTGCAGAAGG GTACGCTCGA TGACGTGCTG AAGGCGATCG ACCTCGCCAC CGGCGTCCGC GTCGCCACGC TCGGGATCTC CGGCGCGACG ATTGCGACCG CCAATGGCTC CGCCAACTCG TCGATCACCA GCGGTTCGCT GAAGCTGTCG ACCGGCCTCG CGTCGGATCT GACCATCAAC GGCACCGGCA ATGCGCTGGC CGCCCTCGGC CTCACCGGCC CGAGCGGGAC CTCGACCTCG TTCACCGCGA CCCGCGGCGT TGCCGCCGGC AGCCTGAACG GCAAGAGCTT GACCTTCTCC TCCTTCAATG GCGGTTCGGC GGTCAATGTC ACGCTCGGCG ATGGCAGCAA CGGTACCGTC AAGTCGCTGG CCCAGCTCAA CGTCGCGCTG GCTGCCAACA ACCTGACGGC GTCGATCGAC AACGCCTCCG GCAAGCTGAC GATCGCAGCG TCGAACGACT ACGCCTCCCG CACGCTGGGC GGCGCTGACG GCGGCGTGCT CGGCGGCACT CTGGCCTCGC AGTTGACCTT CACCGTGCCG ACCGCACCGG TGGCCGACGT CAACGCCCAG AACACCCGCG CCGGCCTGGT CAAGCAGTTC AACGACGTGC TCGACCAGAT CAAGACCACC GCGCAGGACT CTTCGTTCAA CGGCGTCAAC CTGCTGAACG GCGACAACCT GAAGTTGGTG TTCAACGAAA CCGGCAAGTC CACGATCTCG ATCCAGGGCG TGACCTTCAA CCCGACCGGC CTCGGCCTGT CGACCCTGGC CTCGGGCACG GACTTCATCG ACAACAACGC CACCAACTCG GTGTTGACCA AGCTGAGTGC CGCCTCGACC GCGCTGCGGT CGCAGTCGTC CGCTTTCGGT TCGAACCTGT CGATCGTGCA GGCCCGTCAG GACTTCTCGA AGAGCCTGAT CAACGTGCTG CAGACCGGCT CGTCCAACCT GACGCTGGCC GACACCAACG AAGAAGCGGC GAACAGCCAG GCGCTGACGA CCCGTCAGTC GATCGCGGTC TCGGCGCTGT CGCTGGCCAA CCAGTCGCAG CAGAGCGTGC TCCAGCTGCT GCGCTAA
|
Protein sequence | MSGIVLSNAV RQNLSSLQAT ADLLATTQSR LSSGKKVNSA LDNPTNFFTA ASLDARASDI NNLLDGISSG VQILQAANTG ITSLTKLVDS AKSIANQALQ TTSGYATKSN VSATISGATA EDIRGTQSFD NAVATGNVIF DGTTGGTTAA AGTDTLGGAI VSIAASAAVT VLGAVDATAT GAVLSVGTAA ATAAGTNLIS ELTNGSTVTA TGPAAGDSIT VNGKTITFTT AGASSKDSSG NYTIGLDQTV NSLLDTIDTV NGNTGLSSAV TAGRLELHSG TNSPLTVGDN AGGAVLAKLG LTAQTVDTAA ATASANISAS TQLFNTHGGL TTTAIADGTT LSVNGKTITF KTADAPQGNN IPTGTGVLGR IGTDGNGNST IYLGDQTKFT NATVGDLLTA IDLANGVKAA SISSGVATIS TNSGQTASAV AAGITTIQSS TGGDLNVTAF TDLFKNLGLT TSSGTGPLTL TKQRTTSGTT MGTLIADGST LNVNGKTITF KNAAVPTASS SHTGISGNVE TDGSGNSTVY LQKGTLDDVL KAIDLATGVR VATLGISGAT IATANGSANS SITSGSLKLS TGLASDLTIN GTGNALAALG LTGPSGTSTS FTATRGVAAG SLNGKSLTFS SFNGGSAVNV TLGDGSNGTV KSLAQLNVAL AANNLTASID NASGKLTIAA SNDYASRTLG GADGGVLGGT LASQLTFTVP TAPVADVNAQ NTRAGLVKQF NDVLDQIKTT AQDSSFNGVN LLNGDNLKLV FNETGKSTIS IQGVTFNPTG LGLSTLASGT DFIDNNATNS VLTKLSAAST ALRSQSSAFG SNLSIVQARQ DFSKSLINVL QTGSSNLTLA DTNEEAANSQ ALTTRQSIAV SALSLANQSQ QSVLQLLR
|
| |