Gene VC0395_A1141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1141 
Symbol 
ID5136026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1201076 
End bp1203097 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content48% 
IMG OID640532599 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001217087 
Protein GI147674542 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0337081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTGA CAAAAAATTT ATCCTTAACC CAAACACTCG GTGCGGTGTT TCTATGCATA 
ACCATACTGA TGATTAGCTT ATCTGTCACC AGTTTACGTG GCATAGAGCG GGTTGGCGCT
CAGTTTAATC AACTGTCTGA ACAAGCTTTG CCCTTTGCGC TTAATAATGC AGCGTTAACG
CAAAATTTTC TCGAACAAGT GAAGTACCTA GGTTATGGCA CTCGTAGCCA ATCTGAGCAA
GAGCTCAATC AAGTGCTCAA TGAATGGCAA AAGCTGGATG CTCAAGCGGG AGATGAAATA
ACAAGATTGC AGCAGAACGT ACAATTGCTC TCTAGCGCTG AAGCGGTTCA ACAAGCCGAG
CAGTTACAGC GAGAGATCCT TCATTTTCAA CAGCTAGCCC AATCGATTCT CAAACTTCAA
CAACTGCAAT TGAGTAAAAC AGCACAAATC AGTGAGCAAG CGAAACAATT TCGTTATGGC
TTGAGTTCAA TCGGACCTGA AATGGGCCGT ATCGCCTCGT TTTTAGCGGT GGATAACCCA
GAAGCCATGG ATGCAGCGAA CCGATTTACC GCCAGTGCCA GCGCAATGGA AAGTGCTTTT
TTATTGCTGT TTATCGAAGA AGAGATGTCA GCAGCGCAAA AATACCGCCA AGAGTTAAAA
AATCGGGTAG CGGGATTGGA ACTGGCGTTT GATGACTTCA AAGAGTGGTA CCCAGAAATT
AAAGATTACG CGAGCTTAAC TGCCCCCTAT GAAATGGTGT TAGCAGGCTT TCAAGCGCAA
GCCGTGATAG AGCAGATCAT TAATAAACTG GAAGACGCGC AACAACAAAA TAAGGATTTT
GCAAGCGCTG CAGAGGTTGC GCAGCAACTT GTCACGCAGC TCAATCAGTG GTCAACATTG
GCTCAACAAC ACATTGTGCA GGGTAAGCAA GAGGTAACAT CAACCATTTC TGCCGTCACC
CTCACACAGC AAATCAGCGG CACTCTCTTG GTGCTGGCGA TTTTGGCGGT GTGGTTTGGC
TTGCGCCGTT GGATAGGGCG AGCGCTGAAC AACATCACTC GTCATCTCGC GCAGTTAACA
CAACATAAGC TTAACCACAG ATTAGATTTA GTCGGGCCGC AAGATTTTCA AAATGTCGCA
GCGCAACTCA ATCAAGTGAT CGTATCAACG CATGAGTCGC TCGCATTAGT CACGCGCAAC
TGCGAAACGC TCTACCAAAC GGCTGAGCTG AGCCATGGTT CCGCAGAACA ATCAAACCAA
AGCTTAGCGG CGCAAAATCA AGCCTTGTTG ACCATGGCTG CAACCATTAA TCAACTGGAT
GCATCAATTC GCGAAATCGC CGGAGTGAGT CACGATTCGT ACACGGATTC TGTGGAAGCG
GCGGAACATT CCGCGCAAGG TGTGAAGGTG ATAGAGCAGA ATCAGCAGCG TTTACAAGCA
TTAGAAACCA CCTTAGCGGT TAATGATGCG GCGATGTCTG AACTCAATCA GCGCGTAACC
AGTATTCGTG AAATGGTGGA TATGATCAGT GGCATTGCCG ATAGCACCAA TTTGTTAGCG
CTGAATGCGG CGATTGAAGC CGCACGAGCC GGAGAGCAAG GGCGCGGTTT TGCCGTAGTG
GCTGACGAAG TTCGTAAATT GGCCAGTGAT ACCAGTAAAC AGACCACCAA TATTCGTGAC
ATGATGAATG AGTTGGTGAC TGCGGCCAGT AAATCACGCC AAGCGGTGGA TGAATCGCGT
AAAGAGATGG TCACCGCCCT ACAATCGAGT GAAGAAGTAA AAAGTACCTT TATGCAGATT
GAGCGTGCAG TGGCACATAT TCGCACTCGA GTCGAGCAGA TCACTCAAGC GACCGAAGAA
CAAAAGCGAG CCACGGCGGA TGTAAACAAA GCGGTAGCGC AAATTTCCGA ACAAGGGCAA
GAAACCAAAC GTCAGTTGGA TGCCATGTTG GAAAGCGCGG AACAAGTCGC AGAGATTGCT
GGTCATCAAC AAGCGATGTT GCATAAGTAT GAGTTGAACT GA
 
Protein sequence
MALTKNLSLT QTLGAVFLCI TILMISLSVT SLRGIERVGA QFNQLSEQAL PFALNNAALT 
QNFLEQVKYL GYGTRSQSEQ ELNQVLNEWQ KLDAQAGDEI TRLQQNVQLL SSAEAVQQAE
QLQREILHFQ QLAQSILKLQ QLQLSKTAQI SEQAKQFRYG LSSIGPEMGR IASFLAVDNP
EAMDAANRFT ASASAMESAF LLLFIEEEMS AAQKYRQELK NRVAGLELAF DDFKEWYPEI
KDYASLTAPY EMVLAGFQAQ AVIEQIINKL EDAQQQNKDF ASAAEVAQQL VTQLNQWSTL
AQQHIVQGKQ EVTSTISAVT LTQQISGTLL VLAILAVWFG LRRWIGRALN NITRHLAQLT
QHKLNHRLDL VGPQDFQNVA AQLNQVIVST HESLALVTRN CETLYQTAEL SHGSAEQSNQ
SLAAQNQALL TMAATINQLD ASIREIAGVS HDSYTDSVEA AEHSAQGVKV IEQNQQRLQA
LETTLAVNDA AMSELNQRVT SIREMVDMIS GIADSTNLLA LNAAIEAARA GEQGRGFAVV
ADEVRKLASD TSKQTTNIRD MMNELVTAAS KSRQAVDESR KEMVTALQSS EEVKSTFMQI
ERAVAHIRTR VEQITQATEE QKRATADVNK AVAQISEQGQ ETKRQLDAML ESAEQVAEIA
GHQQAMLHKY ELN