Gene VIBHAR_04810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVIBHAR_04810 
Symbol 
ID5557382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio harveyi ATCC BAA-1116 
KingdomBacteria 
Replicon accessionNC_009784 
Strand
Start bp71413 
End bp72597 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content50% 
IMG OID640909289 
ProductXaa-His dipeptidase 
Protein accessionYP_001446946 
Protein GI156976040 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGAC CGATTGAATT TAAAACCATC TTGCTGGCGT GTCTTATCAT CAGTGTTGGT 
CAACTCAGCA TGGGCTTGGT GTTTCCATCT CTTCCTTGGA TCGCGAAAGA TTTCGATATT
TCCCTCGACC AAGCTCAGCT GTTAGTCAGT GTTTACTTGC TAGGTTTTGG GCCTTCACAG
TTTATCTATG GCCCGGTATC CGATGCATTG GGCCGAAAAA AGGTGCTGTT GGCTGGCTTG
TTGATTGCCA TGCTCGGCCT ATTGATGATC ATCTTCCTAA GCCACACTTT CACTGGCATG
GTGGCGGGAC GTTTTCTGCA AGGTTTAGGA ACTGGCTGTT GCGCGGTGTT AGCCCGAGCG
TCTACTCGCG ACCGCTTCAA TGGCCCTGAG CTCCCTGTCG CTTTGTCCTA CATTGCTATG
GCAGCTTCTA TTACACCGTT AGTTGCTCCT GTTATTGGTG GTTTCATCAA CGCCCACTTC
GGCTGGACCA TGGTGTTTAT CTCGCTGTTG GGTTACGTAT TGTTGGCTTG GACTGTGATC
GTATTTCGCT TCCAAGAGAC CATCACACAA ACCTCAGCCT TGCCATCACC GAAAAAGATG
CTGCTGCAAT ATCGTGACCT TTTGACTTCT CGTTACTTTA TGAGCTTTGC CAGTATTGGT
TGGCTTAACT TCAGCTTGAT GATCACCACT GTTTCGGTGA TGCCTTTCAT CATGCAAAAC
CAAACCGGCA TGACATCCGA TCAATACGCG ATGTGGGCAC TGATTCCGGC GTTCGGCATG
ATCTGCGGCA CCAGTATCTG TAACCGTGTG CGACCAATCA TCGGCACGAA GAAGATGCTA
CTGGTCACGC CAATCCTGCA CGTCAGTTCC GCAGCGTGGC TGTTCTTCTG CCCTGTTGAG
CCGCTGTACT TAATGCTAGG CCAACTGCTG ATGATTTTAG GCAATGCTAT CGCTCTACCT
TGTGCTCAAG CCATGGTAAT GCAACCCTAT AAGAAACAAG CGGGGGCAAC TGCGGCGATG
TCGGGCGGCG GCCAAATGGT GGTGTCATCG ATTGTGAGTA TGGCATTGGT GCAGCTCGGA
TTAAGCCAAG CGTGGCATCT GTCATTAGTA ATCGTGGTCT TCGCGCTCAT TACACTGACC
AATGTTTTGC GAGGCTTCAC CACAGAGCAA CCTTCAGAGC AATAA
 
Protein sequence
MSRPIEFKTI LLACLIISVG QLSMGLVFPS LPWIAKDFDI SLDQAQLLVS VYLLGFGPSQ 
FIYGPVSDAL GRKKVLLAGL LIAMLGLLMI IFLSHTFTGM VAGRFLQGLG TGCCAVLARA
STRDRFNGPE LPVALSYIAM AASITPLVAP VIGGFINAHF GWTMVFISLL GYVLLAWTVI
VFRFQETITQ TSALPSPKKM LLQYRDLLTS RYFMSFASIG WLNFSLMITT VSVMPFIMQN
QTGMTSDQYA MWALIPAFGM ICGTSICNRV RPIIGTKKML LVTPILHVSS AAWLFFCPVE
PLYLMLGQLL MILGNAIALP CAQAMVMQPY KKQAGATAAM SGGGQMVVSS IVSMALVQLG
LSQAWHLSLV IVVFALITLT NVLRGFTTEQ PSEQ