Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4452 |
Symbol | |
ID | 6412136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4781174 |
End bp | 4783840 |
Gene Length | 2667 bp |
Protein Length | 888 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714334 |
Product | flagellin |
Protein accession | YP_001993423 |
Protein GI | 192292818 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.438814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGTA TCGTTCTATC CAACGCCGTT CGCCAGAATC TTTCTTCGCT CCAGGCCACG GCTGACTTGC TCGCCACCAC CCAAAGCCGC CTCTCGTCCG GCAAGAAGGT GAACACGGCG CTCGATAATC CGACTAACTT CTTCACCGCC GCTTCGCTCG ACAGCCGCGC CAGCGACATC AACAACCTCC TCGATGGCAT CGGCAACGGC GTGCAGATCC TGCAGGCCGC CAATACCGGC ATCACCTCGC TGAACAAGCT GGTGGACAGC GCCAAGTCGA TCGCCAACCA GGCCCTGCAG ACGACCTCCG GCTACGCCAC CAAGTCGAAC GTGTCGGCCA CCATCTCCGG CGCCACCGCT GACGACCTGC GCGGCACCCA GAGCTACTCG AACGCGGTTG CCACCGGCAA CGTGATCTTC GACGGCACCG CGGGTGGCAG CACCGCTGCG TCCGGCACCG ACACCCTCGG TGGCGCGATC GTCAGCATCG CGGCGGGTAC GGCTGTGACC GCTCTCGGCG CCGCTGACAA CACCGCGCTC GGCAGCGTTC TCAGCGTCGG CACCGCCGCC GCCACCGCGG GCGGCTCCAA CCTGATCAGC GATCTCACCA ACGGTTCGAC CACCACGGCG ACCGGTCCGG CTGCGGGCGA CTCGATCACG GTGAACGGCA AGACCATCAC CTTCACGACT GCCGGTGCCG CCAGCAAGGA CAGCAACGGC AACTACACGA TCGGTCTCGA CCAGACCCTG ACCAAGCTGG CCAACACGAT CGACGATATC AACGGCAACA CCGGCCATTC GTCGACCATC ACCGCCGGCA AGCTGGAACT GCACTCGGGC ACCAACAGCC CGCTGACGAT CGGCGACAAC GCCGGCGGCG CCGTGCTGGC CAAGCTCGGC CTGACCGCGC AGACCGTCGA CACCGCGGCT GCGACCGCCT CGGCCAACAT CTCGGCCACG ACGCAGCTGT TCAACACCCA TGGTGGCCTC ACCACCGCGG CGATCGCGGA CGGCACCCAG CTGACGGTCA ACGGCAAGAC CATTACGTTC AAGACCTCCG ACGCTCCGCA GGGCAATAAC ATCCCGACGG GCACCGGTGT TCTCGGCCGT ATCGGCACCG ACGGCAACGG CAATTCGACG ATCTATCTCG GCGACCAGAC CAAGTTCAGC AACGCGACCG TTGGTGACCT GCTGACCGCG ATCGATCTGG CCAACGGCGT CAAGTCGGCG ACCATCTCGT CGGGTGTCGC AACGATCAGC ACCAACTCCG GCCAGACTGC TTCGGACGTC ACCGGTGGTA TCACCACCAT CCGCAGCTCG ACCGGCGCCG ACCTCAACGT CACCGGCTGG ACCGACCTGT TCAAGAACCT CGGTCTGACC AGCGCTACCG GTACCGGTCC GCTGACCCTC ACCAAGCAGC GCACCACCAG CGGCACCACG ATGGGCACGC TGATCGCGGA CGGCTCCACG CTGAACGTGA ACGGCAAGAC CATCACCTTC AAGAACGCCG CTGTTCCGAC TGCGTCGTCG ACCCACCAGG GCATCTCCGG CAACGTCGAG ACCGATGGCC AGGGCAATTC GACCGTGTAC CTGCAGAAGG GCACCATCGA TGACGTCCTG AAGGCCATCG ACCTCGCCAC CGGCGTCCGC ACGGCTTCGC TGGGTGTCTC CGGTGCCACG ATCTCGACTG CCAACGGCAC GGCCAACTCG TCGATCACGA GCGGCTCGCT GAAGCTGTCG ACCGGACTTG CCTCCGACCT TAGCATCACC GGCACCGGCA ACGCGCTGGC TGCCCTCGGC CTCACCGGCC CGAGCGGCAT CTCCACCTCG TTCACCTCGG CTCGTGGCGC TTCGGCCGGC AGCCTGAACG GCAAGACGCT GACCTTCACC TCCTTCAACG GCGGTGCGGC CACCAACGTC ACCTTCGGCG ACGGCACCAA CGGCACCGTC AAGTCGCTCG CTCAGCTGAA CGCTGCGCTG GCGTCCAACA ACCTGACGGC GTCGATCGAC AACGCCTCCG GCAAGCTCAC GATCGCAGCG TCGAACGACT ACGCCTCCCA CACTCTGGGT GGCTCGGACG GCGGTGTGAT CGGCGGTACC CTGGCTTCGA CCCTGACCTT CTCGGTGCCG AACGCGCCGG TGGTCGACGT CAACGCCCAG ACCACCCGCG CCGGCCTGGT CAAGCAGTTC AACGACGTGC TCGACCAGAT CAAGACCACG GCTCAGGACG CTTCGTTCAA CGGTGTGAAC CTGCTGAACG GCGACACCCT GAAGCTGGTG TTCAACGAAA CCGGCAAGTC GACGATCTCG ATCCAGGGCG TCACCTTCAA CCCGACCGGC CTTGGCCTGT CGAACCTGAG CTCTGGCGTC GACTTCATCG ACAACAACGC CACCAACGCC GTGCTGAGCA AGCTGAGCAC CGCTTCGACC GCCCTGCGGT CGCAGGCCTC CGCGTTCGGT TCGAACCTGT CGATCGTGCA GGCCCGTCAG GACTTCTCGA AGAGCCTGAT CAACGTGCTG CAGACCGGTT CGTCGAACCT CACGCTGGCC GACACCAACG AGGAAGCGGC GAACAGCCAG GCGCTGACGA CCCGCCAGTC GATCGCGGTG TCCGCGCTGT CGCTGGCCAA CCAGTCTCAG CAGGGCGTGC TGCAGCTCCT CCGCTAA
|
Protein sequence | MSGIVLSNAV RQNLSSLQAT ADLLATTQSR LSSGKKVNTA LDNPTNFFTA ASLDSRASDI NNLLDGIGNG VQILQAANTG ITSLNKLVDS AKSIANQALQ TTSGYATKSN VSATISGATA DDLRGTQSYS NAVATGNVIF DGTAGGSTAA SGTDTLGGAI VSIAAGTAVT ALGAADNTAL GSVLSVGTAA ATAGGSNLIS DLTNGSTTTA TGPAAGDSIT VNGKTITFTT AGAASKDSNG NYTIGLDQTL TKLANTIDDI NGNTGHSSTI TAGKLELHSG TNSPLTIGDN AGGAVLAKLG LTAQTVDTAA ATASANISAT TQLFNTHGGL TTAAIADGTQ LTVNGKTITF KTSDAPQGNN IPTGTGVLGR IGTDGNGNST IYLGDQTKFS NATVGDLLTA IDLANGVKSA TISSGVATIS TNSGQTASDV TGGITTIRSS TGADLNVTGW TDLFKNLGLT SATGTGPLTL TKQRTTSGTT MGTLIADGST LNVNGKTITF KNAAVPTASS THQGISGNVE TDGQGNSTVY LQKGTIDDVL KAIDLATGVR TASLGVSGAT ISTANGTANS SITSGSLKLS TGLASDLSIT GTGNALAALG LTGPSGISTS FTSARGASAG SLNGKTLTFT SFNGGAATNV TFGDGTNGTV KSLAQLNAAL ASNNLTASID NASGKLTIAA SNDYASHTLG GSDGGVIGGT LASTLTFSVP NAPVVDVNAQ TTRAGLVKQF NDVLDQIKTT AQDASFNGVN LLNGDTLKLV FNETGKSTIS IQGVTFNPTG LGLSNLSSGV DFIDNNATNA VLSKLSTAST ALRSQASAFG SNLSIVQARQ DFSKSLINVL QTGSSNLTLA DTNEEAANSQ ALTTRQSIAV SALSLANQSQ QGVLQLLR
|
| |