Gene VC0395_0602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0602 
Symbol 
ID5134945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp655093 
End bp656703 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content50% 
IMG OID640530924 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001215441 
Protein GI147672450 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein
[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATACAT TTGCGCTCAT AAAAATAACC GAAGAACAAC AAGCCGAAAT GTCAGCCTAT 
ACTCCCTCTG CTCAACAAGA AGTCCTTGTC GGCGACCATG ACCAATTGGT ATCAACGACG
GATTTAAAAG GTGTCATCAC CTACTGTAAT GATACATTTT GTCGTATTGC TGGGTTCCAA
GCCGATGAGC TGCTAGGAAA AAATCACAAT ATCGTTCGCC ATGCTTCGAT GCCGAAAGCG
GCGTTTGCCG ATATGTGGCA CCACCTTAAG CAAGGGCATG CTTGGCGCGG AATCGTGAAG
AATCGCACTA AATCCGGCGG GTTTTACTGG GTTGACGCCT ATGTAACGCC GATTTATCAA
CAAGGCCAAC TTACAGGTTA TCAATCGGTT CGGGTAAAAG CTGAGCGAAA GTGGGTAGAG
ATTGCCACCA AAGCCTACCA AGCGTTATTA GTGGCTGAAA AGGCCGGTAA AAAAATCCAA
TTTAAGCTGC ATACTTCGCT GCGTTACGCC TTGTTGCTCG GTGCATTGAT GTCACCAGCA
TTGGCACATG GGTTTCAAGC TCCTGAACAG TGGCAATGGT TAGCCAGTCT TTTACCAGCG
GGAGTATTAG GTTTGTTGTT TCGGCAAGAG CTAGTGCGTA CCCCGCAACA GCTCAAACAG
TGGCAAAACG AGTATGACAG TATCAGCCGT TTGATCTATT CGGGCGCTGA TGCTTTTTCT
GTTGCCGACT ACCATCTAAA AATGGCTTCC GCTCGGATCC GTACCATTCT CGGTAGAATG
ATGGACTCGG CGCGTCCATT GGGTGAACTG GCCAATCAGC TGCATTTGAC GACGCAAGAA
GTGCATCAAG CGTTGGCGGC ACAAAACAGC AACATCCAAG CCGTAACGCA AGCCACCGAT
GCTGTCGAGA GCGCCGCTGA GCGGGTTTCG AGCCATACTC ACTCGGCTCA TCAATTGATT
GATCAAGTGC AAAATCATTG TGCCGAGACC AAACACAGCA TCAATGTCAC CCATCAGAAT
TTACAGCGAC TGGCTACGCA AGCTGAGAGT GCTGCCTTGA CGACACTAAA ACTGAGCGAT
CAAGCGCAGC AGGTCGGGCA ATTAATGACT GAAATTGGCG GGATTGCTGA GCAAACCAAT
TTACTGGCGC TGAATGCTGC GATTGAAGCC GCTAGAGCGG GTGAACAAGG TCGAGGGTTC
GCCGTTGTGG CGGATGAAGT GCGCGCATTA TCGGCTCGTA CTCAGCGTGC CACACAGCAG
ATCCAAACCA GCATTGATAC TATGTTGTCC ACTATTGAAG CGTGGCGAGG GGATATCACT
GCAAGTCGTG ATCAAACCGA GCAGTGCGCT CAAGATGCGA ATACCACGCT GCAGCAGTTG
CAGGATGTAG AATGCGTAAT GAGTGACATG CTGCGAGTTA TCGGTGAGGT CGCGAGTGCT
GCGCAACATC AACGTGAGCT CACCTGCGAG GTTAACCAAC ATATCCACTC TATCGCCAGT
GTGGCAACGC AAAATTCCGC GGCCACCCAT ACCGTAGAAC AGTTAGCGAT GGCGATGAGC
GGTAAAGTGG CAGAATTTGG AGCGCTTTCG AAGCAGTTTG CACAAAAATA G
 
Protein sequence
MYTFALIKIT EEQQAEMSAY TPSAQQEVLV GDHDQLVSTT DLKGVITYCN DTFCRIAGFQ 
ADELLGKNHN IVRHASMPKA AFADMWHHLK QGHAWRGIVK NRTKSGGFYW VDAYVTPIYQ
QGQLTGYQSV RVKAERKWVE IATKAYQALL VAEKAGKKIQ FKLHTSLRYA LLLGALMSPA
LAHGFQAPEQ WQWLASLLPA GVLGLLFRQE LVRTPQQLKQ WQNEYDSISR LIYSGADAFS
VADYHLKMAS ARIRTILGRM MDSARPLGEL ANQLHLTTQE VHQALAAQNS NIQAVTQATD
AVESAAERVS SHTHSAHQLI DQVQNHCAET KHSINVTHQN LQRLATQAES AALTTLKLSD
QAQQVGQLMT EIGGIAEQTN LLALNAAIEA ARAGEQGRGF AVVADEVRAL SARTQRATQQ
IQTSIDTMLS TIEAWRGDIT ASRDQTEQCA QDANTTLQQL QDVECVMSDM LRVIGEVASA
AQHQRELTCE VNQHIHSIAS VATQNSAATH TVEQLAMAMS GKVAEFGALS KQFAQK