Gene VC0395_A2418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2418 
Symbol 
ID5136124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2571707 
End bp2573686 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content52% 
IMG OID640533870 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001218318 
Protein GI147673102 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00134538 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTAT TGGCACGTCT TTTCTCTACA GCACCACATC ATCCGACGCA ACCAGCGGAG 
GTTAAGCCAG ACCCGCTGAT GCAATCTCCG GTAACCACCG AGACGACGGC ATTACAGCCA
GAACCACAGG TTGCGCCTTA TCAAACTGAC CTTGCGCTCA CCACGCCAGA GTCAGACCTC
ACACCCCAAG TCAATCAAGC TTTTGTTCAG GTGATTCGTG AACATCTGGC ACTGTTGGAA
TGTGAGCCCA ACGGCACGAT TTGCTATGCC AGTGATGCGT TTGCTCATCT CTGTCGGGTA
TCGGCTGAAG CGATGGTCGG AGCGGATTTT GCGAACTTAT GGCGCACTCA TCAGCAGCCA
TCTGTGCAGC GCTTGCTGCA GGATGCTAAA AAAGGACACC CTGTTTCTGC CGAATTACAG
CTCAGCCGCT CGCAAGAGGC AATCTGGATA AAAGCGGATC TGTATCCCAT CAAGGCGATA
AACGGTCAAT TACAAAACGT GGTGGTACTG TTGCAAGATA TCACTGCGGC CAAGCTGGAA
AAAATTGACC GTAGTGGTCA AATGAATGCG GTCAATCTCA CCCAAGCCGT GATTGAATTT
ACGCTCGATG GCACCATTCT GACTGCAAAT CAGAATTTTC TGCAAACGGT GGGCTATCAG
TTGGATGAGA TCCAAGGGCG GCACCACAGC ATGTTTGTTG ATGAGCAGTA TAAACAGAGT
CAAGAGTACC AACATTTCTG GCAGCGTTTG CGCTCCGGCG AATTCTTTGT CGACGAATAC
AAACGTTTCG GCAAAGGGGG GAAAGAGATC TGGATCCAAG CCAGCTATAA CCCGATTATG
GACAGCGAGG GTAAACCGTA TAAGGTTGTC AAATATGCCA CCAACGTGAC TCAGCGTAAG
ATGGTGGTCA ATGAAGTCAA ACGCGTCATG ACGGCGCTCT CCAGCGGCGA TCTCAGTGCA
CAACTCACGC ATCCTTTTGA AGGTGAATTT GCTGAGCTCG GTGAGGTGAT CAGTCAATTC
ATTGTCACAT TGCGTCAAAT CATTACTGAC ATCAACAGCG TCGCCGCGAC CATCAAGCTA
GCAGCGACGG AGATTTCCAA TGGCAATACC GACCTCTCTA GCCGAACCGA GCAACAAGCC
TCCAATCTGG AGCAAACGGC CTCCAGCATG GAAGAGCTAA ACAGCACGGT GCGACAAAAC
TCGGATAACG CCATGCAGGC GAACATCCTT GCGGGTAAAG CCACGGAAAT CGCCGCCAGC
GGTGGAGAAC TGATTGAGCA AGTGGTAGTG ACTATGGCCT CGATTAACGA ATCCGCGCAA
AAGATCGCCG ATATTATAGG TGTGATCGAT GGTATTGCTT TCCAAACCAA TATTCTGGCA
CTCAATGCTG CGGTAGAGGC CGCTAGAGCC GGTGAACAAG GCCGCGGATT TTCCGTAGTC
GCCTCCGAAG TACGCTCACT CGCACAGCGC TCAGCCAATG CAGCGAAAGA TATTAAGGCG
CTCATCTCAG ATTCCGTGAG TAAAATCAGC AATGGGAATG AATTGGTGGA TCGCTCCGGC
AGCACCATGA AAGACATAGT CGTGTCGATC AAGCGAGTCC ATGATTTGAT GGCCGATATC
GCCTCAGCGT CAGCCGAGCA GGCAACGGGG ATCAATGAAG TGAACCAAGC AGTCAATCAG
ATGGATGAGA TGACCCAACA AAATGCCGCA CTCGTCGAAG AGGCAGCCGC GGCTTCAGAA
AGCCTATTAG CACAGGCCGA GCAGCTCTAT GACCATGTCG CTATGTTCAG ATTGCCAGAT
CAGGACACGA GCGCCCCATC ACTGTTGAAA GCGGTCAATA AGCGCCCGCA ATCCGCACCA
GTGACTCGAC ATCCGGCCAG CCACATCGCT AAAACGCCAG CAAAAATCAC GGCCAAAGCC
AGCTCGAGGG CACAACCCGT TATGCAGGTC GCGCACGATG AAGAATGGGA GAGTTTTTGA
 
Protein sequence
MGLLARLFST APHHPTQPAE VKPDPLMQSP VTTETTALQP EPQVAPYQTD LALTTPESDL 
TPQVNQAFVQ VIREHLALLE CEPNGTICYA SDAFAHLCRV SAEAMVGADF ANLWRTHQQP
SVQRLLQDAK KGHPVSAELQ LSRSQEAIWI KADLYPIKAI NGQLQNVVVL LQDITAAKLE
KIDRSGQMNA VNLTQAVIEF TLDGTILTAN QNFLQTVGYQ LDEIQGRHHS MFVDEQYKQS
QEYQHFWQRL RSGEFFVDEY KRFGKGGKEI WIQASYNPIM DSEGKPYKVV KYATNVTQRK
MVVNEVKRVM TALSSGDLSA QLTHPFEGEF AELGEVISQF IVTLRQIITD INSVAATIKL
AATEISNGNT DLSSRTEQQA SNLEQTASSM EELNSTVRQN SDNAMQANIL AGKATEIAAS
GGELIEQVVV TMASINESAQ KIADIIGVID GIAFQTNILA LNAAVEAARA GEQGRGFSVV
ASEVRSLAQR SANAAKDIKA LISDSVSKIS NGNELVDRSG STMKDIVVSI KRVHDLMADI
ASASAEQATG INEVNQAVNQ MDEMTQQNAA LVEEAAAASE SLLAQAEQLY DHVAMFRLPD
QDTSAPSLLK AVNKRPQSAP VTRHPASHIA KTPAKITAKA SSRAQPVMQV AHDEEWESF