Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_0997 |
Symbol | |
ID | 5133980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | + |
Start bp | 972518 |
End bp | 973594 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640531319 |
Product | AraC/XylS family transcriptional regulator |
Protein accession | YP_001215833 |
Protein GI | 147672282 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000000441723 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAACGA TTCGCATATA TGTTAAATTG AGCTCAACCA ATAGAGAAGT TGAATTGATT ATGCAAAATC GCACATCAGT TATGGAAAAC ACGCAAAATC AACATAAGGT ATTAACCAAA AACGGCGCAC GCCAAACGGT CACCTTGTCA CGTCATGTGG TCAAGGAGCG AGATGAGCAA ATTGTTGCGC AAAATTTGAA TCAACCTGTG ATGGCACAAG GCCACTTTGT GGAATACGTC AGTCCAAACG GGTTTACCTT GCATGGAGGG TCTAGCCTTG AATTGGCCGA TTGCGATGTG ATGACCACCA GCGCGCCAGC CCTTGTCATC ATTCTCTTGC TGGAAGGCAC CTTACGCTTT GCCTATGATG ATCTCAAACT TGAATTGTGT GCCAATCAGC ATCCACAAGC TTTAATGGTC AATTTGGAGC AGCCGTGCAT CTTCCATCGC CGTCTGCATC AAGGGATGAC AGTACGCAAA CTCAACATCG TTTTATCGCC CGATACATTG CAAAAATTCG CGCAACACTC CTGCCCATTG CAACATTTTC TGCAACAAGA CAAAGCACTG GTTCCACTTT CACTGAACGA GGAAAGTTGG CAAGCCGTTG AGTCTCTGCT CAACCGTCGT GTGATCCACA CCATCAGTGC GCATATTGCT CGGGAAGCCA CGGTTTGGCG CTTGGTGCAT GACGCCGTGT TGCAATGCCC TAAACACGCC ACCTCGTTAT CTCAGCACCA AGAGGGACAA TCTGAGCAAT GGATTAATCA GTTGCTGCAC TATATCGATC AGCATTTGCA TGAAGAGATT TGCTTAGAGC AACTGGCAGA ACGCCATGCG ATGAGCGTTT CCAACTTACA ACGCAAATTT AAAACTCGCC TGAATATGAC GATTGCACAC TACATCCGGC ACCGACGGTT ACAACTGGCG CGTCAGCAAT TAGAGCGAGG TTTGGTCACC ATCACAGAGG CCGCGTATGA AGCAGGCTAC CTGCACCCTT CCAACTTTAC CGCAGCGTTT AAAAAAGCCT TTGGTATCTC TCCGCAGGCT TTTGTGGAAT TAAAACAGGC GGGTTAA
|
Protein sequence | MITIRIYVKL SSTNREVELI MQNRTSVMEN TQNQHKVLTK NGARQTVTLS RHVVKERDEQ IVAQNLNQPV MAQGHFVEYV SPNGFTLHGG SSLELADCDV MTTSAPALVI ILLLEGTLRF AYDDLKLELC ANQHPQALMV NLEQPCIFHR RLHQGMTVRK LNIVLSPDTL QKFAQHSCPL QHFLQQDKAL VPLSLNEESW QAVESLLNRR VIHTISAHIA REATVWRLVH DAVLQCPKHA TSLSQHQEGQ SEQWINQLLH YIDQHLHEEI CLEQLAERHA MSVSNLQRKF KTRLNMTIAH YIRHRRLQLA RQQLERGLVT ITEAAYEAGY LHPSNFTAAF KKAFGISPQA FVELKQAG
|
| |