Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2418 |
Symbol | |
ID | 5136124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2571707 |
End bp | 2573686 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640533870 |
Product | methyl-accepting chemotaxis protein |
Protein accession | YP_001218318 |
Protein GI | 147673102 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00134538 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTAT TGGCACGTCT TTTCTCTACA GCACCACATC ATCCGACGCA ACCAGCGGAG GTTAAGCCAG ACCCGCTGAT GCAATCTCCG GTAACCACCG AGACGACGGC ATTACAGCCA GAACCACAGG TTGCGCCTTA TCAAACTGAC CTTGCGCTCA CCACGCCAGA GTCAGACCTC ACACCCCAAG TCAATCAAGC TTTTGTTCAG GTGATTCGTG AACATCTGGC ACTGTTGGAA TGTGAGCCCA ACGGCACGAT TTGCTATGCC AGTGATGCGT TTGCTCATCT CTGTCGGGTA TCGGCTGAAG CGATGGTCGG AGCGGATTTT GCGAACTTAT GGCGCACTCA TCAGCAGCCA TCTGTGCAGC GCTTGCTGCA GGATGCTAAA AAAGGACACC CTGTTTCTGC CGAATTACAG CTCAGCCGCT CGCAAGAGGC AATCTGGATA AAAGCGGATC TGTATCCCAT CAAGGCGATA AACGGTCAAT TACAAAACGT GGTGGTACTG TTGCAAGATA TCACTGCGGC CAAGCTGGAA AAAATTGACC GTAGTGGTCA AATGAATGCG GTCAATCTCA CCCAAGCCGT GATTGAATTT ACGCTCGATG GCACCATTCT GACTGCAAAT CAGAATTTTC TGCAAACGGT GGGCTATCAG TTGGATGAGA TCCAAGGGCG GCACCACAGC ATGTTTGTTG ATGAGCAGTA TAAACAGAGT CAAGAGTACC AACATTTCTG GCAGCGTTTG CGCTCCGGCG AATTCTTTGT CGACGAATAC AAACGTTTCG GCAAAGGGGG GAAAGAGATC TGGATCCAAG CCAGCTATAA CCCGATTATG GACAGCGAGG GTAAACCGTA TAAGGTTGTC AAATATGCCA CCAACGTGAC TCAGCGTAAG ATGGTGGTCA ATGAAGTCAA ACGCGTCATG ACGGCGCTCT CCAGCGGCGA TCTCAGTGCA CAACTCACGC ATCCTTTTGA AGGTGAATTT GCTGAGCTCG GTGAGGTGAT CAGTCAATTC ATTGTCACAT TGCGTCAAAT CATTACTGAC ATCAACAGCG TCGCCGCGAC CATCAAGCTA GCAGCGACGG AGATTTCCAA TGGCAATACC GACCTCTCTA GCCGAACCGA GCAACAAGCC TCCAATCTGG AGCAAACGGC CTCCAGCATG GAAGAGCTAA ACAGCACGGT GCGACAAAAC TCGGATAACG CCATGCAGGC GAACATCCTT GCGGGTAAAG CCACGGAAAT CGCCGCCAGC GGTGGAGAAC TGATTGAGCA AGTGGTAGTG ACTATGGCCT CGATTAACGA ATCCGCGCAA AAGATCGCCG ATATTATAGG TGTGATCGAT GGTATTGCTT TCCAAACCAA TATTCTGGCA CTCAATGCTG CGGTAGAGGC CGCTAGAGCC GGTGAACAAG GCCGCGGATT TTCCGTAGTC GCCTCCGAAG TACGCTCACT CGCACAGCGC TCAGCCAATG CAGCGAAAGA TATTAAGGCG CTCATCTCAG ATTCCGTGAG TAAAATCAGC AATGGGAATG AATTGGTGGA TCGCTCCGGC AGCACCATGA AAGACATAGT CGTGTCGATC AAGCGAGTCC ATGATTTGAT GGCCGATATC GCCTCAGCGT CAGCCGAGCA GGCAACGGGG ATCAATGAAG TGAACCAAGC AGTCAATCAG ATGGATGAGA TGACCCAACA AAATGCCGCA CTCGTCGAAG AGGCAGCCGC GGCTTCAGAA AGCCTATTAG CACAGGCCGA GCAGCTCTAT GACCATGTCG CTATGTTCAG ATTGCCAGAT CAGGACACGA GCGCCCCATC ACTGTTGAAA GCGGTCAATA AGCGCCCGCA ATCCGCACCA GTGACTCGAC ATCCGGCCAG CCACATCGCT AAAACGCCAG CAAAAATCAC GGCCAAAGCC AGCTCGAGGG CACAACCCGT TATGCAGGTC GCGCACGATG AAGAATGGGA GAGTTTTTGA
|
Protein sequence | MGLLARLFST APHHPTQPAE VKPDPLMQSP VTTETTALQP EPQVAPYQTD LALTTPESDL TPQVNQAFVQ VIREHLALLE CEPNGTICYA SDAFAHLCRV SAEAMVGADF ANLWRTHQQP SVQRLLQDAK KGHPVSAELQ LSRSQEAIWI KADLYPIKAI NGQLQNVVVL LQDITAAKLE KIDRSGQMNA VNLTQAVIEF TLDGTILTAN QNFLQTVGYQ LDEIQGRHHS MFVDEQYKQS QEYQHFWQRL RSGEFFVDEY KRFGKGGKEI WIQASYNPIM DSEGKPYKVV KYATNVTQRK MVVNEVKRVM TALSSGDLSA QLTHPFEGEF AELGEVISQF IVTLRQIITD INSVAATIKL AATEISNGNT DLSSRTEQQA SNLEQTASSM EELNSTVRQN SDNAMQANIL AGKATEIAAS GGELIEQVVV TMASINESAQ KIADIIGVID GIAFQTNILA LNAAVEAARA GEQGRGFSVV ASEVRSLAQR SANAAKDIKA LISDSVSKIS NGNELVDRSG STMKDIVVSI KRVHDLMADI ASASAEQATG INEVNQAVNQ MDEMTQQNAA LVEEAAAASE SLLAQAEQLY DHVAMFRLPD QDTSAPSLLK AVNKRPQSAP VTRHPASHIA KTPAKITAKA SSRAQPVMQV AHDEEWESF
|
| |