Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VIBHAR_02101 |
Symbol | |
ID | 5554177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio harveyi ATCC BAA-1116 |
Kingdom | Bacteria |
Replicon accession | NC_009783 |
Strand | + |
Start bp | 2099848 |
End bp | 2102724 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640907588 |
Product | hypothetical protein |
Protein accession | YP_001445293 |
Protein GI | 156974386 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATT TAACACCAAT GCAGGCTGCA TGTTGGTTTG GAAGAAATGC GAACGCTAAA CTTGGTGGTG TTGCGTCCCA CCTTTATACC GAATTCGATG GCAAGAATAT TAACCTTGAA AAACTTCACG CTGCTTTAAT AAAATTATAT AAACGGCATG AAATGTTGCG ATTGAAGGTT GATAGCTCAG GCACCTGTTC TATTATCGAT GAGCCTAATG GAAACATTTT AGAGGTAGAT GATTTTAGTC AGTTATCGAC TGATAAATTA ACTGATTCCT TGAGTGAAAA AAGAAACCAA TGGGCACATC AAAAGCTTGA TTTAACACAA GGACAAGCCG CAAAGTTCTC AGTGAGTTTA TTACCTGATG ATGAGTTTCG CTTTCATATT GATACCGATA TGATTGCTAT CGATCCAGAT AGCTGTCGTA TTTTAATCGA GGACTTAGCG ACGCTTTATG AAGGCGGTGT ATTTGATACG GATGATAAGC CGTCCTTTTT TGCGTGGCAT AATCTCGCAA AAGAAGAGCC AGAACTCAAG CTACAGCGAA AAGTCGACCG AGCGTGGTGG AAAGCTCATT TAGAAAGTAT TGCGCCAGCA CCCTCCTTGC CATTTCCTGA AGGCAACCCA AGTCAAATTA CAAGTGAGCA CTACACCGCT TGGCTTGATT CTGAACAGCG ATCCACTCTG TTTTCTCTCG CAAAGCAATA CAAAATCACG CCCGCTAATC TGATGCTAGG TTTTTTCGCA ACAACCTTAG GCACGGCGAC AGGCGATGAA GCTTTTAGGA TTAACGTTCC AGTATTTTGG CGGCCACCAA TAACGCAGGG TACAGAAAGG GTTGTCGGCG ATTTCGTCAA CTTTGTTGTG CTCAGTGTTG ATATGACGCA GGCCGACACG CTGATCGATT TTTGCCACTC TGTCGCCGAA AAAATGGGGC CGTTACTTGG TCATAGCTTC TATGACGGTG TGAATGTGAT GCGAGATCTT TCGTTGCATC ATGGTAGTGC CCAGTTAGCG CCGGTTATTT TTACTTCTGC TCTTGATTTA CCGAGTGGTG ATCTTTTCTC CTGTCGTGTC CATAAACATT TTGGCAAGAT GAATTGGACT ATCTCTCAAG GCTCTCAGGT TGCACTGGAT TCACAAGTAG TGAGCATTGA TGGTGGTATC ATGATTAATT GGGATGTCCG ACAAGAAGCC CTACCAAAGG AATGGACATC TGCGATGTTT GATCACTTTG TGGCTCTCAC CAAAAGCGTC ATTTCCAACC CTAAATTATT GACCGCTTCG ATGGGCAGTT TGCACTCAAA GCTCGTATGC CAGTCTGACT TAACTGCGGA GCTGAGCTCG ATGCAGCGCG CATATTTATT AGGTCGAACG ACGCAGATGC CATTAGGCGG AGTTGCGATG CAGGAGCTCC TTGAATATCG AGGTACGCTT TCTCCAGTTG TTATCCGTCG CAGGCTATCA GAAATGGTCA TCCGATACCC AAGCTTGAGG AAATATATTG ATAGTAAATC ACTCGAGCTC AAAGTTAGCC TATGCCCACA AGTGAACCTG ACTTTGCATG ATTTATCTTG CACAGAAAAA GACAGGGTAG AAACTGAGTT AGCACGCTTC AGGCATGATT ACAGCCACGC CATGTTTGAT TTAGAGCGAC CGCTATGGGA TCTCACAATG TTTGCGCTTC CGGATGGTAT CACTCATGTG TTCGCGCGCT TTGATGCGTT AATCCTCGAT GCTCGCTCTA TTGCTGCGCT GTTGGTGGAG TTGTTTGAAG GTGAAGCGCC TTACTTACCG CATATTGAAC CCGAGTTATC GACAGAAGAC CCGAAGATTA CTCGCCATAG AGACGAAAAC TATTGGCTGG AAAAATTAAA ATCTGTTGAT AAACCGATGC AATTCCCTTG GCATAAATCA CTCGATTTGA TTCCTTGTTC TCGTTACCAA AGACAGAGTC TTGAGATTGA TAAAGAGACG GTGAAAAAGC TCGTCCGGGT GGGAGGCAAG CAGGGGCTTT ACAAGAATAC CATGATGATG TCAGCAGCTC TTGAGGCGAT ATCTTCGCAT GTGAATGGTA GAAAGTTGTG TGTTGCGGTT CCGGTGTTGC CAGTGGTGAG CGCGACATAC TCAAGTGAAT CAAGTTTTAT TGCTGTTCAA TGGCAAGCGA CACAAAGCGA TTATCTTCAG CGAGCCAAAA GCTTGCAAGC AGACACGTTA GAAGGGCTCG AACATCTCGC CTTTTCTGGG GTTGATCTGG CTCGTACCTT ATTTGAACGA TGTGGACCAG GGCCAACTTT GCCGATTGTG ATAACAAACG GTTTATCGTG GCCAACGCTC AGTAAGAATG CTCCGATGCA ACTTCAGCGT GGGCTGACAC AAACTCCTCA AGTTGCAATG GATGTGCGTT TTGTTGCGAT GGCTGGTGGC TCGATTATGT TCAGCGTTGA TTATGCCAGC GAAGCCGTGG CGGACGAACA GGTAAAAATG ATATTGGATC ATATCGATAT GACTTTCTGC TATATGGCTG AGACTTCTCT GTTTGAAACT GCCCCTGAGC AAACTCTCAA GCCTTTGGAG TCCAGAGTGA TAGAGCTCTT CGATAGACCG CTCAGCGATG TGGCTGCTGA AGACTCAACC GAACAGATTA TCTTTGATGT CTATTGTCAG GTGCTTGGTA AACCAGTGAC TGATGAGACC AAAGAGAGCC TGCCTTTTTC CCAGTTGGGT TTGAGGCCTA ATCATCTAAA ACAAATCAGT ACTGAATTAA ATAAAGTGCT ACGGGTGGAG TTGCCACCGA TGCAGCTTAT TCGCTGTAAA GATGCTGCAG AGGTAAAAGC GCTCGCTTTA ACTCAGGGTT TCGAGGTCGA AACCTAA
|
Protein sequence | MSDLTPMQAA CWFGRNANAK LGGVASHLYT EFDGKNINLE KLHAALIKLY KRHEMLRLKV DSSGTCSIID EPNGNILEVD DFSQLSTDKL TDSLSEKRNQ WAHQKLDLTQ GQAAKFSVSL LPDDEFRFHI DTDMIAIDPD SCRILIEDLA TLYEGGVFDT DDKPSFFAWH NLAKEEPELK LQRKVDRAWW KAHLESIAPA PSLPFPEGNP SQITSEHYTA WLDSEQRSTL FSLAKQYKIT PANLMLGFFA TTLGTATGDE AFRINVPVFW RPPITQGTER VVGDFVNFVV LSVDMTQADT LIDFCHSVAE KMGPLLGHSF YDGVNVMRDL SLHHGSAQLA PVIFTSALDL PSGDLFSCRV HKHFGKMNWT ISQGSQVALD SQVVSIDGGI MINWDVRQEA LPKEWTSAMF DHFVALTKSV ISNPKLLTAS MGSLHSKLVC QSDLTAELSS MQRAYLLGRT TQMPLGGVAM QELLEYRGTL SPVVIRRRLS EMVIRYPSLR KYIDSKSLEL KVSLCPQVNL TLHDLSCTEK DRVETELARF RHDYSHAMFD LERPLWDLTM FALPDGITHV FARFDALILD ARSIAALLVE LFEGEAPYLP HIEPELSTED PKITRHRDEN YWLEKLKSVD KPMQFPWHKS LDLIPCSRYQ RQSLEIDKET VKKLVRVGGK QGLYKNTMMM SAALEAISSH VNGRKLCVAV PVLPVVSATY SSESSFIAVQ WQATQSDYLQ RAKSLQADTL EGLEHLAFSG VDLARTLFER CGPGPTLPIV ITNGLSWPTL SKNAPMQLQR GLTQTPQVAM DVRFVAMAGG SIMFSVDYAS EAVADEQVKM ILDHIDMTFC YMAETSLFET APEQTLKPLE SRVIELFDRP LSDVAAEDST EQIIFDVYCQ VLGKPVTDET KESLPFSQLG LRPNHLKQIS TELNKVLRVE LPPMQLIRCK DAAEVKALAL TQGFEVET
|
| |