Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VIBHAR_02194 |
Symbol | |
ID | 5555466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio harveyi ATCC BAA-1116 |
Kingdom | Bacteria |
Replicon accession | NC_009783 |
Strand | + |
Start bp | 2191219 |
End bp | 2194176 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640907681 |
Product | hypothetical protein |
Protein accession | YP_001445384 |
Protein GI | 156974477 |
COG category | [R] General function prediction only |
COG ID | [COG3979] Uncharacterized protein contain chitin-binding domain type 3 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA ATAAAGTCAG TATGGGACTT AATATAGTTG GTGGAATATT GCTTTCAGGA ATAGCAAACG CTCAAGTCAT TCAATTTAAT AATGAAACGC CAGTAAATAA TTTACAGGGT GATTTTGAGG CGATGGTGAT TTATGCGCAA AACATTACTG TCCCTCATTA CAATGATAAA GGCGACCCTA GGCCGCATCT CACCGCATTG CGTGATACTT TACTTCTTGT TAAGCCAGTT GAACCTTTAA ATGTCTCATC ACAAGTCAAA GTTATTGCAC GAGATAGTCA AGGCAATCAG TTGGGTGCTT TATCACTAAA CACACCAGAG CAACTTCCCA ATCATGACGG CCCCAACCTC GACATTATGT ACGCGAAAGA CATGTGGTCT GTAACGCTCC CAGCGACTTG GATACAACCC GGTCTGACTT TGGAGTTGCT CAATGGGAGT ACATCTGGTC TTCTGTCCGA TTTGGATATT GGCGCGCCTA ACGAACTGCT CATTAATACG ATGGATTTGG GGATGCTGAC TAATCCAAGG GGACGTTTCT ACTTCGCTGA TAATCCAGAG TTGCACAAGG ATTACTTCCA GAAAATTCCG GTAAGTAAGT TGATTGTTAA CCCTTACGAA ACCGTGAAAC TTGATGAAGT GATGCTACCG GATGGAGATT TACTTACAGA GCTTGACCCG AGCACTGGGA CTTGGCACCA AGGAGATATG CGAGCGGATA CCACGAAGAT CTTGATGTCC CATGGTATTA ATCTCGCAAA CTATGGGATC AATTCATCAA CGTCAGTGTC AGAGAGAGCA CACCCGTACA CGGCGAACCA AATTACCGCG ATTGCAGCAG TAGGTCGTTA TCAAAATGGT GTTGTTGCTC ACGGTGGTAG TGGAGGCAAC GGCATGGTGA CTATCGACTC ATCAGTGGGA AATGAATGGA GTCACGAAGT TGGCCATAAC TTTGGGTTGG GTCATTGGCC AGGAGGCACT GATGGGACGA CACACCGCCC ATCAACGGAC ATCAACTCTG CTTGGGGCTG GGATCTATTT CAGAAGCGTT TCATTGCCAA CTTCATGTGG AACAAGCGTA ATGGCGAAGA CCAAGTTTGC TGTAGCGATG GGATTGGCAT TCCTGCATTT GAAGGCTACA AGTTTAATCG AGATGCGATG GGGGGTGGTG AGCCAACTTC ACCAATTTCC AAATACACGC TGCATACTCC TTATGTGCTA GAGAAAATTC AGACTTTTAT GGAGGGCAAA GCTACATTTG ATGAATCATC TCCTACTGGG TTTACTAAGT GGAATGATGA AACGAAAGCG ATGCAAGCGT TCCAGCAACC CGCTATTTTG TTATCGAAAT CGATTACCTC TCAATCACAA TTGAATACGA TCAAAGACGA CATCGATGGC TCCGTATTGT TGGGCTACAT CAACGATTTT GACATCACGA AAGTCGAGAC GGGAGATGGA CGTTGGATAC GCGATATTTA TTTGCCTAGT GCAGAAAATG TTGCAGTAGG CAAAGCTGTG AACATTGCGC GTTACTCTGG CTATGGAGTG ACGGCTCATG TTAATGGTCA GTTGGTTAAC TTGAATCGAG GTGATAGTAA GTTTTACATC AGTGATGGAA AAGCATGGCT AGAAACCACT GAAGATCAGG TAGCCGAGGG TAAACCAACA CGTATTCCAA CTGACTATGG TGTACCAGTA ACGACGCTAG TAGGTTATTA CGATCCTCAG CAACAATTAG ACAGCTACAT TTTCCCTGCT TTGCACGGCG CTTATGGGTT TGTTTATCAA CCTACTCCTG TTGATCGCCT CGATACGACA GGTTGTTATG TCAAAGTGTA TAACGGTCAA GACCACCAAA CGGACAATTA TCAGTTGGTT GGATTCCGTT TCGACGATAA TGTGATGAAC AAATTCCATA TCAACCTTAA GCAGACCGAT GATCCAACAA GGGCAGAGGT TGTGTGTGAT AACGCTGTTT TAAGCTCGTT AGATATAGAG AAGCCAAAGC AAGCGTTAGA GGTTTCTATC GTCCAGTCAG ATTCTTTAAA ACCATCCGAG AACAATAAGC CCGTAGCGCA CGCAGGTGAA GACCAGTCAG TATTGTCAGG AGCAACTATC ACCTTATCGG CCGACAAATC TACTGATGCC GATGGTGACA AGTTAACCTA TGTTTGGGAG CAGATATCGG GTTTGCCCGC GAGCATTCAA GCAAGCAACG CAATGACAAC CAATGTGGTC TTGGCAGAAT CGACACAAGA GCAATCTTAT GTGTTTTCTG TGTTGGTTTC AGACGGTAAA TCGAGCTCCA GTGATATGGT CACCATTATT GCTCAGCCCC AACCAACTCA AAACCATGCA CCGGAAGTCT CTCTTCCAAG CAGTATTGAT GCAAAACCCG GAGAGGTTAT AGAGATAACC GCGACGGCAT CGGATCCTGA TGGTGATGTG CTGTCATTTA AGTGGAGCAC TTCAGGTTTA GCTTATCAGA CTATGTCCGG TGGCACGATT CAACTTTCAG TTCCAGACGT GACGGTTGAT AGTCAATTTT TGCTAAGTGT TGTTGTTTCC GATCCTAAAG GGGAGAGTGC AAGCGCAAAT ACAACGCTTA ACGTGAAAGC CAGCGGCAAT ACCTGCAGCG TCAGTGACCC AAATGCCACG AATTACGACG CGTGGTCTGC GAGCAAATCC TATTCAGGTG GTGACTTAGT CAGTTATAAG CAGTTGGTTT GGAAGGCAAA GCACTGGAGT CAAAATAATC AACCTGACAG TAGTGATGCT TGGGAGCTCC TAAGTGATGT TGTGCTGCCA TGGTCAAGTC AAGCAGCATA CTCCGGAGGT GCTCAGGTTA CCTATAACGG GGTTAAATTT GAGGCCAAGT GGTGGACTCG TGGCGAGCAG CCTAATGTGT CCAGTGTCTG GATCAACAAG GGCGCTGCCT GCCAGTAG
|
Protein sequence | MKRNKVSMGL NIVGGILLSG IANAQVIQFN NETPVNNLQG DFEAMVIYAQ NITVPHYNDK GDPRPHLTAL RDTLLLVKPV EPLNVSSQVK VIARDSQGNQ LGALSLNTPE QLPNHDGPNL DIMYAKDMWS VTLPATWIQP GLTLELLNGS TSGLLSDLDI GAPNELLINT MDLGMLTNPR GRFYFADNPE LHKDYFQKIP VSKLIVNPYE TVKLDEVMLP DGDLLTELDP STGTWHQGDM RADTTKILMS HGINLANYGI NSSTSVSERA HPYTANQITA IAAVGRYQNG VVAHGGSGGN GMVTIDSSVG NEWSHEVGHN FGLGHWPGGT DGTTHRPSTD INSAWGWDLF QKRFIANFMW NKRNGEDQVC CSDGIGIPAF EGYKFNRDAM GGGEPTSPIS KYTLHTPYVL EKIQTFMEGK ATFDESSPTG FTKWNDETKA MQAFQQPAIL LSKSITSQSQ LNTIKDDIDG SVLLGYINDF DITKVETGDG RWIRDIYLPS AENVAVGKAV NIARYSGYGV TAHVNGQLVN LNRGDSKFYI SDGKAWLETT EDQVAEGKPT RIPTDYGVPV TTLVGYYDPQ QQLDSYIFPA LHGAYGFVYQ PTPVDRLDTT GCYVKVYNGQ DHQTDNYQLV GFRFDDNVMN KFHINLKQTD DPTRAEVVCD NAVLSSLDIE KPKQALEVSI VQSDSLKPSE NNKPVAHAGE DQSVLSGATI TLSADKSTDA DGDKLTYVWE QISGLPASIQ ASNAMTTNVV LAESTQEQSY VFSVLVSDGK SSSSDMVTII AQPQPTQNHA PEVSLPSSID AKPGEVIEIT ATASDPDGDV LSFKWSTSGL AYQTMSGGTI QLSVPDVTVD SQFLLSVVVS DPKGESASAN TTLNVKASGN TCSVSDPNAT NYDAWSASKS YSGGDLVSYK QLVWKAKHWS QNNQPDSSDA WELLSDVVLP WSSQAAYSGG AQVTYNGVKF EAKWWTRGEQ PNVSSVWINK GAACQ
|
| |