Gene VIBHAR_01041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVIBHAR_01041 
Symbol 
ID5556143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio harveyi ATCC BAA-1116 
KingdomBacteria 
Replicon accessionNC_009783 
Strand
Start bp1049896 
End bp1051530 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content48% 
IMG OID640906535 
Productmalate synthase 
Protein accessionYP_001444261 
Protein GI156973354 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATGC TTGCTCAGAC AGAACAAAAA ACACAACAAC TAAAGCAAAC CCAAGGCATG 
CTTGAGGTGA ATGGAGTCGT TGCTCCTGAA CATCAAGCAA TTTTCCCTGT TGAAGCCCAA
ACCTTTTTAT CTCTACTGTG TGAAAAATTT GCCGAACGTG TTGAACAGTT GCTGGAAGCA
CGAGAAGAGA AGCAAGCACG CATCGACGCT GGTGAACTGC CAGACTTTCT ACCAGAGACA
CAAGACATTC GTGAAGGAAG CTGGAAGATC CTTGGAATCC CGCAAGATCT GCAAGATCGC
CGAGTGGAAA TCACTGGACC AACCGATCGC AAGATGGTGA TTAACGCACT GAATGCGAAT
GTAAAAGTGT TCATGGCCGA TTTCGAAGAT TCGATGTCCC CTGCGTGGAG TAAAGTTCTG
GATGGTCAAA TCAACCTGCG CGACGCTGTT AATGGCACCA TCAGCTACAG CAATCCTGGC
AATGGCAAGC ACTATCAGCT GGCAGAAGAC CCAGCGGTGT TGATCTGCCG TGTTCGCGGA
CTGCATCTAA AAGAAAAACA CGTAACGTGG CACGGTCAGA TCATTCCAGG TGCGCTATTC
GATTTCGCTC TGTACTTCTA CAACAACTAC AAAGCGCTAC TGAAAAAGGG AAGCGGTCCT
TACTTCTACA TTCCAAAACT GCAATCGCAT CATGAAGCTA AGTGGTGGAG CGAAGTGCTC
CATTTCACCG AAGAATATTT CGGTTTGGAT ACTGGCACCA TCAAAGCGAC TGTACTGATT
GAAACCTTAC CAGCCGTATT CGAAATGGAT GAGATTCTGT TCTCTCTGAA AGAGCACATC
GTTGGTTTGA ACTGTGGTCG CTGGGATTAC ATCTTTAGCT ACATCAAAAC ACTGAAAAAC
CACCCAGATC GCGTACTTCC GGATCGCCAA GTGGTGACCA TGGATAAGCC ATTCCTCAAC
GCTTACTCAA GATTGTTGGT GCGAACTTGT CATAAACGTG ACGCATTTGC GATGGGCGGC
ATGGCAGCCT TTATTCCGGC TAAAGACCCA CAAGAAAACC AAAAAGTGCT GGATAAGATC
CACAACGATA AATCACTAGA AGCCAACAAC GGTCATGACG GCACTTGGGT TGCTCACCCT
GGTTTGGCAG ACACCGCAAT GGAAGTGTTC AGTGCCACAC TGGGGCAGCG CACTAACCAA
TTGGATGTGA GCCGCTCAGA AGACGCACCA ATCACCGCCG CAGAGCTGCT TGAACCTTAC
GAGGGTGAAC GCTCAGAAGA AGGTATGCGC CACAACATCC GCGTTGCGCT GCAATACATC
GAAGCGTGGA TCTCTGGCAA TGGTTGTGTG CCGATTTACG GGCTGATGGA AGACGCTGCA
ACGGCAGAAA TCTCCCGTGC TTCTATTTGG CAATGGATTC AACACGGTAA GTCGCTCGAC
AACGGATTGA AGGTCACTAA AGAGCTGTTC GAACTCTATC TGAAAGAAGA GATTGAAGTC
GTGGAGCAAG AAATTGGCGA GCAGCGTTAC CAAGCAGGTC GATTTGAAGA AGCGGCAGAT
TTGATGGCTA GGCTCATCAC AAGCGATGAA CTGACCAACT TTTTAACCAT TCCAGGTTAC
GACTACTTGG ATTAA
 
Protein sequence
MIMLAQTEQK TQQLKQTQGM LEVNGVVAPE HQAIFPVEAQ TFLSLLCEKF AERVEQLLEA 
REEKQARIDA GELPDFLPET QDIREGSWKI LGIPQDLQDR RVEITGPTDR KMVINALNAN
VKVFMADFED SMSPAWSKVL DGQINLRDAV NGTISYSNPG NGKHYQLAED PAVLICRVRG
LHLKEKHVTW HGQIIPGALF DFALYFYNNY KALLKKGSGP YFYIPKLQSH HEAKWWSEVL
HFTEEYFGLD TGTIKATVLI ETLPAVFEMD EILFSLKEHI VGLNCGRWDY IFSYIKTLKN
HPDRVLPDRQ VVTMDKPFLN AYSRLLVRTC HKRDAFAMGG MAAFIPAKDP QENQKVLDKI
HNDKSLEANN GHDGTWVAHP GLADTAMEVF SATLGQRTNQ LDVSRSEDAP ITAAELLEPY
EGERSEEGMR HNIRVALQYI EAWISGNGCV PIYGLMEDAA TAEISRASIW QWIQHGKSLD
NGLKVTKELF ELYLKEEIEV VEQEIGEQRY QAGRFEEAAD LMARLITSDE LTNFLTIPGY
DYLD