Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VIBHAR_01999 |
Symbol | |
ID | 5556374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio harveyi ATCC BAA-1116 |
Kingdom | Bacteria |
Replicon accession | NC_009783 |
Strand | + |
Start bp | 1990862 |
End bp | 1993855 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640907486 |
Product | hypothetical protein |
Protein accession | YP_001445191 |
Protein GI | 156974284 |
COG category | [S] Function unknown |
COG ID | [COG5283] Phage-related tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATC AATCTAAGTA TCAAGTTGCC ATTGCAGCGA TTGACCAATA CTCCGCACCT CTCAAGGCGG CAAGCGCATC GTTTGACTCA CTGACTGAGG ATGTTAAAGC TCAGTCGGCA GAGATCAAAA AGCTCAATGC CAGTGCCGCC AGCCTTAACT CTTACGCGAG CATTAAGCGT GATTTAACTG AAACGTCAGC TCAAATGGGC TCGGCTAAAG TGAAAGCGGA GCAGTTGGCT GTAGCCACAA AGGAGCTTAG TCTCCAAACG AAAGGCTACG AGTCAGCAGT TTCTAGTTCT CAAAGTAAGC TTCAAAACCT TGAAGCTCAA ATGAGGGCAA CGGAACACCC GAGTAAAGCT CTGCGAACAG CCATTAAGGA AGCTCGCAAA GAAGTTAAAC TCAATACAGC CAATCTTAAC GAACATCAGC TGCAGCTGAG TAAAACACAG GCTGCTTATG ATAAATCACG TCATGAAGTT TCCCAGCTCA CCAAGCACTA CGATGTTCAG AGAACGAAGC TTCATGGTCT GAAATCCAAG TTAAATGAGT CTGGTATTCA GGCGCATAAG TTTGGTGAAG CACAACGTCA GATCAAGCGT GATATTACGG CTGCGAACGC TGCACTGGAT AAGCAGAAAG CCAAGCTGAA ATCGGTGCAG GCTGCATCTG CAAAGATTGA GTCGAATAAG ATGGCTCGCT CTGAATTAGG GGGTGAGGCG TTAGATTTGG CAATGAAAGG TGCTGTTCTG GCAGTGCCGA TTAAGTTTGC TGTCGATTTC GAATCAGCGT TTGCTGATGT GAAAAAAGCG GTGAATGATG CCAGTGATGA AGAGCTGGAG GTGATGAAGA AAAGGATCGT TATTGAGGCG CCTAAATTGG GTGTTACTCA AGAAGGTCTT TCTGCGATTA TTGCCGAGGG TGCTCGTAAC GGCATCGCAA AAGATGAGTT GTTTAACTTT GCTGAGTCAG CAGCAAAGAT GTCTGTTGCC TTTGATATGG CCGCCGATGA AGCTGGCGCG TCAATGATGA AATGGCGCAC GACGATGAAT CTTAGTCAGG ATCAAGCGGT TAATCTGGCG AATGCGGTGA ACTATGTTGG TGACAACATG GCAACCACGG CAAAAGACAT CACCGAAGTG CTGGTTCGTC AGGGTGCAGT GATCACCAAT GCAGGTCTTG ATGAGGTTCA AGCGGCCTCA CTATCAGCAG CAGTGTTGTC CGGTTCAGCA AGTACAGAAA TCGCGGCTAC CGCGACAAAG AATTTGTTGC TGAGTTTAAC GGCTGGCGAT TCAGCGAGTG GTGGTCAGAA AGATGCGTTA ATGACCTTGG GATTTGACCC AGCCGATTTA GCGCGAGACA TGCAGGAAAA CGCACCTAAG ACGGTAGAGC AGGTGCTCTT GGCAATTAAA GACCAAGATG CCGACGTTCA AACCGCCTTG ATGAAGAACC TATTCGGTTC TGAGTCTATC GGTTCAATCG CGCCACTTTT GCAAAACCTT GATAACTTCC GTAAAGCATT CAAGTTAGTA GAGAAAGATA CCAACTTTGC AGGCTCTATG CAGAACGAGT TTGAGGTTCA GTCTGCTACT GCCATGCGGA AAATATCAGC CTTTACTGCT TCCATAACCG GACTGTTCAC CGTTTTAGGT GAAAGCATGC TGCCTGTTGT AGGTGATGTG CTTGATACCA TAACGCCAGC AGTCACATGG TTAACAGAAG CAGCACAATC GGCACCAGGT GCAACTGCTG CATTAATGGC TATTCCTGCT GCCTTGGTGG CTGTAAAAGG TGCAGCATTA GCTTTTAAAG CAGGTAAGCT GTTACTTGGT CAGGGTAAGA ACTATGCCGA TTTAGGTAAA GCTAAGCTTG GCATTGGTTT GGATGGTACT GCGGAATCGG CTCAGAAAGC GACGTCACGC CTTTCTCGAT TGAATCAGAC ATTGGATAAC CTTGGTAACA ATGGGGGCCG TGGTCGCAAT CGTTATCAAC GTGATGGTGC ATCTACTTCC AGTCGTCGTA AATCTCGTCC TAAAAGAGCA AGATTATCGC GTCGTGGTGG TAAGTTTGGG CGTCTGCTTG ATTTTGGTAG TCGATTAACT GACTTCCTTC CTATGGGTGC GCCAGCACCT GCCTTTGCGG CATCCTCTAA TGGGTCTGCA AAAGGGAAAT GGGGCAAGCG AGCTGCTGTT CTTGGTGGTG GTACTGCGTT ATCAATGCTT ACCTCAAGTG CTAATGCGGC TGATTTGGCT CTGATGGGAG CTGATGCGGC ATCTGTGGCA TCTGTGGCAG GTGATGTGGC AGGTTCTTTG CCGTTAAAAG GCATGATGGC GGGTATTGCA GGCACAGCAG GAAAACTATT CAAACCACTT AACGTCATGC TGCAGGGCGC GGCATTAACT TCTGCGATCA ATAACGGTAG CGCTGAAGAA ATCGGAGGTA CTGCAGGTGA TATGGCAGGC GGTTTGGGTG GTGCGGCATT AGGTGCAACG ATTGGTACCG CGATTTTACC TGGTATAGGT ACGGTCGTAG GTGGTGCTCT TGGTGGTTTA GCCGGTGGCG AACTTGGCGA GTGGCTCGGG ACGAAAGTTG CTGGTCTGTT TTCTAGTGAT GAAGAGGAAA CTCTGACTGC TAAAGCGGCA ACGCTGGACA AAGAAAAAGA CGGATTAACT CAGCCTACTA GCCAGTTGCA AGCTGATGCT GTGCTGAATG GTCCTGATGG CGGTTCAAGT TTGGCAATGT TACCCGAACC GACGAGTGTG CTTCCGTCAC CGGATAGTAT TGCGAAGTCG CTTTCTCAGA CAAATCAAGA TAACCGCAAG GTCGAAATTA ACTTAACTAT CCCACCATCC TCAAGCAATC CTCAACAGGA CGAGGATATG CTGAATCGTT TGGTAGCTAA GTTGAAAGAT ATGATGATGG CTGAAGGCAT GATGGGTTCT GGTTCGTTAT CAGTGGCTAT GGACGGTTCT CTTTCTGATA GGAGTGATGT ATGA
|
Protein sequence | MSNQSKYQVA IAAIDQYSAP LKAASASFDS LTEDVKAQSA EIKKLNASAA SLNSYASIKR DLTETSAQMG SAKVKAEQLA VATKELSLQT KGYESAVSSS QSKLQNLEAQ MRATEHPSKA LRTAIKEARK EVKLNTANLN EHQLQLSKTQ AAYDKSRHEV SQLTKHYDVQ RTKLHGLKSK LNESGIQAHK FGEAQRQIKR DITAANAALD KQKAKLKSVQ AASAKIESNK MARSELGGEA LDLAMKGAVL AVPIKFAVDF ESAFADVKKA VNDASDEELE VMKKRIVIEA PKLGVTQEGL SAIIAEGARN GIAKDELFNF AESAAKMSVA FDMAADEAGA SMMKWRTTMN LSQDQAVNLA NAVNYVGDNM ATTAKDITEV LVRQGAVITN AGLDEVQAAS LSAAVLSGSA STEIAATATK NLLLSLTAGD SASGGQKDAL MTLGFDPADL ARDMQENAPK TVEQVLLAIK DQDADVQTAL MKNLFGSESI GSIAPLLQNL DNFRKAFKLV EKDTNFAGSM QNEFEVQSAT AMRKISAFTA SITGLFTVLG ESMLPVVGDV LDTITPAVTW LTEAAQSAPG ATAALMAIPA ALVAVKGAAL AFKAGKLLLG QGKNYADLGK AKLGIGLDGT AESAQKATSR LSRLNQTLDN LGNNGGRGRN RYQRDGASTS SRRKSRPKRA RLSRRGGKFG RLLDFGSRLT DFLPMGAPAP AFAASSNGSA KGKWGKRAAV LGGGTALSML TSSANAADLA LMGADAASVA SVAGDVAGSL PLKGMMAGIA GTAGKLFKPL NVMLQGAALT SAINNGSAEE IGGTAGDMAG GLGGAALGAT IGTAILPGIG TVVGGALGGL AGGELGEWLG TKVAGLFSSD EEETLTAKAA TLDKEKDGLT QPTSQLQADA VLNGPDGGSS LAMLPEPTSV LPSPDSIAKS LSQTNQDNRK VEINLTIPPS SSNPQQDEDM LNRLVAKLKD MMMAEGMMGS GSLSVAMDGS LSDRSDV
|
| |