Gene VC0395_A2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2067 
SymbolleuC 
ID5135145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2226162 
End bp2227565 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content51% 
IMG OID640533524 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001217991 
Protein GI147674703 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAG CCAAAACTCT TTACGAAAAA ATTTACGATG CCCATGTGGT GGTGGCCGCA 
CCGGGTGAAA CGCCGATTCT GTACATCGAT CGTCATCTAG TCCATGAAGT GACCTCTCCA
CAAGCCTTTG ATGGCTTGCG TGAAAAAGGT CGCCCGGTAC GCCAAGTTAG CAAAACTTTT
GCGACCATGG ATCACAACGT CTCCACCACT ACTAAAGACA TCAATGCCTC AGGTGAAATG
GCGCGCATCC AGATGCAAAC GCTTTCCAAA AACTGCGAAG AGTTTGGCGT CACGCTGTAC
GACATTAACC ACAAGTACCA AGGCATTGTG CATGTGATGG GACCTGAGCT TGGTATTACC
CTCCCCGGCA TGACGATTGT ATGTGGTGAT TCACACACCG CAACTCACGG TGCATTCGGC
TCGCTTGCCT TTGGGATAGG CACCTCAGAA GTAGAACACG TACTGGCTAC CCAAACCCTA
AAGCAAGGCC GCGCTAAGAC GATGAAAATC GAAGTGCGGG GCAAAGTGGC TCCCGGCATC
ACCGCTAAAG ACATCGTACT GGCGATCATT GGTAAAACAA CTGCCGCTGG CGGTACAGGC
TATGTCGTGG AATTTTGTGG AGAAGCGATT CGCGATCTCT CCATGGAAGG TCGCATGACC
GTGTGTAACA TGGCGATTGA ACTTGGTGCG AAAGCAGGCT TGATTGCCCC TGATGCAACT
ACATTCAACT ACATCAAAGG CCGCAAGTTT GCCCCACAAG GCAGTGATTG GGATGCTGCT
GTCGACTATT GGCAAACATT AAAAACCGAT GAGGATGCAC AGTTCGATGC CGTCGTGACG
CTTGAAGCCA GTGAAATCAA ACCGCAAGTG ACTTGGGGTA CCAACCCAGG CCAAGTGATT
GCCGTTGATG AGCCAATCCC ATCACCTAGC CAGTTTGCCG ATCCTGTTGA ACGCAGCTCA
GCAGAAAAAG CTCTGGCTTA CATGGGACTT GAAGCCGGCA AAATGCTCTC CGATTATAAG
GTCGATAAAG TGTTCGTCGG TTCATGCACC AACTCGCGCA TCGAAGATAT GCGCGCTGCG
GCAGCGGTAG CCAAAGGGAA AAAAGTCGCC TCTCATGTCC AAGCATTGAT CGTGCCCGGA
TCCGAACAAG TGAAAGCGCA AGCCGAAGCG GAAGGCTTAG ATAAGATCTT TATTGAAGCA
GGATTTGAAT GGCGCTTACC GGGTTGCTCA ATGTGTTTAG CCATGAACAA TGACCGCTTA
GGGCCAGGAG AACGCTGCGC GTCTACCTCA AACCGCAACT TTGAGGGACG TCAGGGACGT
GATGGTCGCA CTCATTTAGT TAGCCCAGCA ATGGCAGCCG CTGCAGCGAT TGCTGGTCAC
TTCGTCGATA TTCGTCAGTT TTAA
 
Protein sequence
MSKAKTLYEK IYDAHVVVAA PGETPILYID RHLVHEVTSP QAFDGLREKG RPVRQVSKTF 
ATMDHNVSTT TKDINASGEM ARIQMQTLSK NCEEFGVTLY DINHKYQGIV HVMGPELGIT
LPGMTIVCGD SHTATHGAFG SLAFGIGTSE VEHVLATQTL KQGRAKTMKI EVRGKVAPGI
TAKDIVLAII GKTTAAGGTG YVVEFCGEAI RDLSMEGRMT VCNMAIELGA KAGLIAPDAT
TFNYIKGRKF APQGSDWDAA VDYWQTLKTD EDAQFDAVVT LEASEIKPQV TWGTNPGQVI
AVDEPIPSPS QFADPVERSS AEKALAYMGL EAGKMLSDYK VDKVFVGSCT NSRIEDMRAA
AAVAKGKKVA SHVQALIVPG SEQVKAQAEA EGLDKIFIEA GFEWRLPGCS MCLAMNNDRL
GPGERCASTS NRNFEGRQGR DGRTHLVSPA MAAAAAIAGH FVDIRQF