Gene VIBHAR_02221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVIBHAR_02221 
Symbol 
ID5553881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio harveyi ATCC BAA-1116 
KingdomBacteria 
Replicon accessionNC_009783 
Strand
Start bp2219373 
End bp2220800 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content46% 
IMG OID640907708 
Producthypothetical protein 
Protein accessionYP_001445411 
Protein GI156974504 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATT TAAGGGACAA CCTCACTCTT TTACTGCACG GGGCATGGCG ACGCCGTTAC 
CTGCTCGTCA TCCCTATGAT AGTGCTACCT ATTTTGGGCT TTCTTATCAG CAAAGCAGTG
CCGACTAAGT ATGTGGCTCA CACCAGCATG CTGATCCAAG AAACGGCCAA AATGAACCCA
TTCCTTCAAG ACCTTGCCGT TTCAACCATG TTGAAAGACC GCTTGAGTGC TTTGAGTACC
CTACTTAAAA GCCGCCATGT GCTGTATTCC GTTGCAAAAG AGCAAGGCTT GATCGACGAT
ACTATGGACG CTAACGAACA AGAGTTCATC ATAAAAGACC TTGCTAATCG CTTAACGGTT
CAACAACTCG GTAAGGACTT CATTCAAATC CAACTCACGA GCAGCCAATC AGAAGGGATG
GAAGCGGTGT TAAGCTCTGT CAGCAATCAC TTTGTCGAAC AGCTCTTAGC GCCAGAGCGC
TCATCAATAA AGGATTCTAG CCACTTCTTG ACTATTCACA TTGATAAACG CCGTGAAGAA
TTGGACAAAG CGGAACAAGC TTTTGCTGAA TACAAAAACG CTTATTCTCA TGCAACACCA
GCGATGCAAG CACAGAGCTT GACGCGTCTC GCCAGCTTAA AACAAACATT GGCAGAGAAG
GAAGCCGAGT TAGCGGGTGT CAAGCGTAGC CTTGGTAGCT TAGACCAGCA ACTTTCAAAG
ACCAATCCAG TGATTGGTAA GATTGAAGAG CAAATTATCG AGATTCGAAG TGAGCTCACT
CTATTGCGTG CACGATACAC AGAAGCACAC AGTTCAGTGC AAGGTAAACT ACGTGAGTTG
AATCGCCTGG AACAGGAGCG CTCAGTACTA CTCAACTCAA AACAACCGGA AATGAACAGT
GACCAACTTT GGGATATTGC AAGTACAACG ACCATAAGCA CGATTGGTGA TGCTCAACCG
CTGCTTGTAT CTCAACTCCG CCAACTGCAA ATCATGCGCG GCCGTTACGA GTCTTTGGAA
GAAGAGACGA TTAGCCTTAG AAACATGATC CAAGAGCTGG AAAGCGACGC CAATCGCTTT
GGTAGCACAG CAACAGAGAT CAATCGACTT GCTCGTGATG TCGCTGTAAA GCGTGAACTT
TATGATGATC TGGTTGAACG TTACGAGATG GCGCAATTGA CCGGATCTTT AGGTGTTTTT
GAAGAAAACA AACGCGTAAA AGTTATCGAT GAGCCTTACA CGCCAACCTT GCCAGCTAAC
TTACCTGCTA TTATCTTTGT CCTTCTTGGA TTAATTGGGG GGGCAGGTTT AGGCATTGGC
CTTGCCACCA TTGCAGAACT GGCAGATAAC TCTATTCGCT CTCGCAAAGC ATTGGAAAAA
CACCTTGGCG CTCCTGTCAT CACTACCATC CCTAAAATCA TATTCTGA
 
Protein sequence
MSDLRDNLTL LLHGAWRRRY LLVIPMIVLP ILGFLISKAV PTKYVAHTSM LIQETAKMNP 
FLQDLAVSTM LKDRLSALST LLKSRHVLYS VAKEQGLIDD TMDANEQEFI IKDLANRLTV
QQLGKDFIQI QLTSSQSEGM EAVLSSVSNH FVEQLLAPER SSIKDSSHFL TIHIDKRREE
LDKAEQAFAE YKNAYSHATP AMQAQSLTRL ASLKQTLAEK EAELAGVKRS LGSLDQQLSK
TNPVIGKIEE QIIEIRSELT LLRARYTEAH SSVQGKLREL NRLEQERSVL LNSKQPEMNS
DQLWDIASTT TISTIGDAQP LLVSQLRQLQ IMRGRYESLE EETISLRNMI QELESDANRF
GSTATEINRL ARDVAVKREL YDDLVERYEM AQLTGSLGVF EENKRVKVID EPYTPTLPAN
LPAIIFVLLG LIGGAGLGIG LATIAELADN SIRSRKALEK HLGAPVITTI PKIIF