Gene VC0395_0997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0997 
Symbol 
ID5133980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp972518 
End bp973594 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content47% 
IMG OID640531319 
ProductAraC/XylS family transcriptional regulator 
Protein accessionYP_001215833 
Protein GI147672282 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000000441723 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACGA TTCGCATATA TGTTAAATTG AGCTCAACCA ATAGAGAAGT TGAATTGATT 
ATGCAAAATC GCACATCAGT TATGGAAAAC ACGCAAAATC AACATAAGGT ATTAACCAAA
AACGGCGCAC GCCAAACGGT CACCTTGTCA CGTCATGTGG TCAAGGAGCG AGATGAGCAA
ATTGTTGCGC AAAATTTGAA TCAACCTGTG ATGGCACAAG GCCACTTTGT GGAATACGTC
AGTCCAAACG GGTTTACCTT GCATGGAGGG TCTAGCCTTG AATTGGCCGA TTGCGATGTG
ATGACCACCA GCGCGCCAGC CCTTGTCATC ATTCTCTTGC TGGAAGGCAC CTTACGCTTT
GCCTATGATG ATCTCAAACT TGAATTGTGT GCCAATCAGC ATCCACAAGC TTTAATGGTC
AATTTGGAGC AGCCGTGCAT CTTCCATCGC CGTCTGCATC AAGGGATGAC AGTACGCAAA
CTCAACATCG TTTTATCGCC CGATACATTG CAAAAATTCG CGCAACACTC CTGCCCATTG
CAACATTTTC TGCAACAAGA CAAAGCACTG GTTCCACTTT CACTGAACGA GGAAAGTTGG
CAAGCCGTTG AGTCTCTGCT CAACCGTCGT GTGATCCACA CCATCAGTGC GCATATTGCT
CGGGAAGCCA CGGTTTGGCG CTTGGTGCAT GACGCCGTGT TGCAATGCCC TAAACACGCC
ACCTCGTTAT CTCAGCACCA AGAGGGACAA TCTGAGCAAT GGATTAATCA GTTGCTGCAC
TATATCGATC AGCATTTGCA TGAAGAGATT TGCTTAGAGC AACTGGCAGA ACGCCATGCG
ATGAGCGTTT CCAACTTACA ACGCAAATTT AAAACTCGCC TGAATATGAC GATTGCACAC
TACATCCGGC ACCGACGGTT ACAACTGGCG CGTCAGCAAT TAGAGCGAGG TTTGGTCACC
ATCACAGAGG CCGCGTATGA AGCAGGCTAC CTGCACCCTT CCAACTTTAC CGCAGCGTTT
AAAAAAGCCT TTGGTATCTC TCCGCAGGCT TTTGTGGAAT TAAAACAGGC GGGTTAA
 
Protein sequence
MITIRIYVKL SSTNREVELI MQNRTSVMEN TQNQHKVLTK NGARQTVTLS RHVVKERDEQ 
IVAQNLNQPV MAQGHFVEYV SPNGFTLHGG SSLELADCDV MTTSAPALVI ILLLEGTLRF
AYDDLKLELC ANQHPQALMV NLEQPCIFHR RLHQGMTVRK LNIVLSPDTL QKFAQHSCPL
QHFLQQDKAL VPLSLNEESW QAVESLLNRR VIHTISAHIA REATVWRLVH DAVLQCPKHA
TSLSQHQEGQ SEQWINQLLH YIDQHLHEEI CLEQLAERHA MSVSNLQRKF KTRLNMTIAH
YIRHRRLQLA RQQLERGLVT ITEAAYEAGY LHPSNFTAAF KKAFGISPQA FVELKQAG