Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VIBHAR_04810 |
Symbol | |
ID | 5557382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio harveyi ATCC BAA-1116 |
Kingdom | Bacteria |
Replicon accession | NC_009784 |
Strand | - |
Start bp | 71413 |
End bp | 72597 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640909289 |
Product | Xaa-His dipeptidase |
Protein accession | YP_001446946 |
Protein GI | 156976040 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCGAC CGATTGAATT TAAAACCATC TTGCTGGCGT GTCTTATCAT CAGTGTTGGT CAACTCAGCA TGGGCTTGGT GTTTCCATCT CTTCCTTGGA TCGCGAAAGA TTTCGATATT TCCCTCGACC AAGCTCAGCT GTTAGTCAGT GTTTACTTGC TAGGTTTTGG GCCTTCACAG TTTATCTATG GCCCGGTATC CGATGCATTG GGCCGAAAAA AGGTGCTGTT GGCTGGCTTG TTGATTGCCA TGCTCGGCCT ATTGATGATC ATCTTCCTAA GCCACACTTT CACTGGCATG GTGGCGGGAC GTTTTCTGCA AGGTTTAGGA ACTGGCTGTT GCGCGGTGTT AGCCCGAGCG TCTACTCGCG ACCGCTTCAA TGGCCCTGAG CTCCCTGTCG CTTTGTCCTA CATTGCTATG GCAGCTTCTA TTACACCGTT AGTTGCTCCT GTTATTGGTG GTTTCATCAA CGCCCACTTC GGCTGGACCA TGGTGTTTAT CTCGCTGTTG GGTTACGTAT TGTTGGCTTG GACTGTGATC GTATTTCGCT TCCAAGAGAC CATCACACAA ACCTCAGCCT TGCCATCACC GAAAAAGATG CTGCTGCAAT ATCGTGACCT TTTGACTTCT CGTTACTTTA TGAGCTTTGC CAGTATTGGT TGGCTTAACT TCAGCTTGAT GATCACCACT GTTTCGGTGA TGCCTTTCAT CATGCAAAAC CAAACCGGCA TGACATCCGA TCAATACGCG ATGTGGGCAC TGATTCCGGC GTTCGGCATG ATCTGCGGCA CCAGTATCTG TAACCGTGTG CGACCAATCA TCGGCACGAA GAAGATGCTA CTGGTCACGC CAATCCTGCA CGTCAGTTCC GCAGCGTGGC TGTTCTTCTG CCCTGTTGAG CCGCTGTACT TAATGCTAGG CCAACTGCTG ATGATTTTAG GCAATGCTAT CGCTCTACCT TGTGCTCAAG CCATGGTAAT GCAACCCTAT AAGAAACAAG CGGGGGCAAC TGCGGCGATG TCGGGCGGCG GCCAAATGGT GGTGTCATCG ATTGTGAGTA TGGCATTGGT GCAGCTCGGA TTAAGCCAAG CGTGGCATCT GTCATTAGTA ATCGTGGTCT TCGCGCTCAT TACACTGACC AATGTTTTGC GAGGCTTCAC CACAGAGCAA CCTTCAGAGC AATAA
|
Protein sequence | MSRPIEFKTI LLACLIISVG QLSMGLVFPS LPWIAKDFDI SLDQAQLLVS VYLLGFGPSQ FIYGPVSDAL GRKKVLLAGL LIAMLGLLMI IFLSHTFTGM VAGRFLQGLG TGCCAVLARA STRDRFNGPE LPVALSYIAM AASITPLVAP VIGGFINAHF GWTMVFISLL GYVLLAWTVI VFRFQETITQ TSALPSPKKM LLQYRDLLTS RYFMSFASIG WLNFSLMITT VSVMPFIMQN QTGMTSDQYA MWALIPAFGM ICGTSICNRV RPIIGTKKML LVTPILHVSS AAWLFFCPVE PLYLMLGQLL MILGNAIALP CAQAMVMQPY KKQAGATAAM SGGGQMVVSS IVSMALVQLG LSQAWHLSLV IVVFALITLT NVLRGFTTEQ PSEQ
|
| |