Gene VIBHAR_00214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVIBHAR_00214 
Symbol 
ID5554327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio harveyi ATCC BAA-1116 
KingdomBacteria 
Replicon accessionNC_009783 
Strand
Start bp213680 
End bp214813 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content49% 
IMG OID640905710 
Producthemolysin 
Protein accessionYP_001443481 
Protein GI156972574 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCTGT TATCCATTTA TATATCTATT GCTATCGGCA TTTCGTTTAT CTGTTCGGTA 
CTGGAGGCGG TACTTCTTAG TATCAGTCCG AGCTACATTG CGCAGCTCAA ACAACAGGGG
CACCCTGCTT CTGCTCAGTT AGAAAAACTG AAAGCAGACA TCGACCGACC ACTGGCTTCG
ATCCTGACGC TGAACACCAT TGCTCACACG ATCGGTGCTG CAACGGCAGG TGCACAGGCG
GCGGTAGTGT TCGGTAGCGA AGCGCTTGGT ATCTTCTCTG CGTTACTGAC ACTGGCGATT
CTTGTTCTAT CAGAAATCGT ACCTAAGACG ATTGGTGCAA CTTACTGGCG CCAACTTGCA
CCATCTGCGG CGGTATCGCT ACGCTGGATG GTATGGGCAC TAACACCATT CGTATGGTTC
TCTGAGCAAA TCACTAAGCG TCTAGCTCGT AACCACGAAG CACCAAAAAT GCGTGATGAG
CTGTCTGCTA TGGCTATTCT GGCAAAAGAA AGCGGTGAGT TCGCAGAAGG AGAATCAAAG
ATCCTAAGCA ACCTACTTGG TATTCAAGAT GTGCCAGTAA CACAAGTTAT GACTCCGCGC
CCAGTAGTAT TTCGCGTTGA CGCGACCATG ACCATCAATG ACTTTCTAGA GAAGCATAAA
GAGACGCCTT TCTCTCGCCC ACTGGTTTAC AGCGAGCAGA AAGACAACAT CATTGGTTTT
GTGCACCGTC TAGAGCTGTT CAAGCTGCAA CAATCAGGCA GCGGCCAAAA ACAACTTGGC
TCGGTCATGC GCCCTATCCA AGTGGTACTG AACAACACCG CTCTGCCTAA AGTGTTCGAC
CAGATGATGA CTCACCGCCT GCAGCTAGCG CTTGTGGTTG ACGAATACGG TACAGTACAA
GGCTTGGTAA CGCTAGAAGA CATCTTTGAG CACCTATTGG GTGAAGAGAT CATCGATGAA
GCGGATAAGA GCACTGACAT GCAAGAACTG GCTTACCAAC GCTGGGAAAG CTGGAAAGAA
AAGCACGGCG TAATTGAAAG CCGCGATGAC GATGAAGAAG AGGAAGAACT GGAGAAACAG
GACAAAGAGC CAGAAGCTCA AGAGCCAACC AAACCAGAAT CTAAAGAATC GTAA
 
Protein sequence
MLLLSIYISI AIGISFICSV LEAVLLSISP SYIAQLKQQG HPASAQLEKL KADIDRPLAS 
ILTLNTIAHT IGAATAGAQA AVVFGSEALG IFSALLTLAI LVLSEIVPKT IGATYWRQLA
PSAAVSLRWM VWALTPFVWF SEQITKRLAR NHEAPKMRDE LSAMAILAKE SGEFAEGESK
ILSNLLGIQD VPVTQVMTPR PVVFRVDATM TINDFLEKHK ETPFSRPLVY SEQKDNIIGF
VHRLELFKLQ QSGSGQKQLG SVMRPIQVVL NNTALPKVFD QMMTHRLQLA LVVDEYGTVQ
GLVTLEDIFE HLLGEEIIDE ADKSTDMQEL AYQRWESWKE KHGVIESRDD DEEEEELEKQ
DKEPEAQEPT KPESKES