Gene VIBHAR_01787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVIBHAR_01787 
Symbol 
ID5556131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio harveyi ATCC BAA-1116 
KingdomBacteria 
Replicon accessionNC_009783 
Strand
Start bp1799516 
End bp1800916 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content47% 
IMG OID640907278 
Producthypothetical protein 
Protein accessionYP_001444983 
Protein GI156974076 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA ACGCGATCCT TCTATCTTTA CTCGGTACGG CTTCCTTCTC TTCTCAAGCG 
GCGGCGGACA TCACCGCAAT CCGCTCTAGC CTATTACATT TCATCAACGA TCCAGCCAAA
GTTGAACACC TTCCCGATGC CTATCAGTAT TTTGAAGACG GTTTATTAGT GATTGAAGAT
GGTCATATCA AAGCCATCAA AGCGTTCAGT GAAAGCGATT CGACACAGTA CCAAGATATT
GTCGATCACA GGGGAAAACT CATCGTGCCG GGCTTTATCG ACACGCACAT CCACTATCCA
CAAACTCAGA TGATTGCGGC GTATGGCGAG CAACTGCTGG AATGGTTAGA AACCTACACC
TTCCCAACGG AAAAACAGTT TGAGGACAAA GCCCACGCCC AAGCTATCTC TCAGTTCTTT
ATTAATGAAC TGCTGAAAAA TGGCACCACC AGCGCATTAG TGTTTGGCAC TGTTCATCCA
CAATCGGTGG AAGCGCTTTT TGAGGAAGCG TTAGATAAGA ACATGCGTAT CATTGCGGGG
AAGGTCATGA TGGATCGCAA CGCGCCAGAC TACCTACTCG ACACACCGGA AACGGGCTAT
AAAGAAAGCA AAAGCCTGAT CAACAAATGG CACAACCAAG GACGATTGCA ATACGCCATT
ACTCCACGCT TTGCCCCTAC CTCGACGCCC AAACAATTAG CGGCTGCTGG GAAACTCAAA
GCCGAGTATC CAGATGTCTA TGTGCATACG CACTTGTCAG AAAACAAAAA TGAAATCGAA
TGGGTGAAGT CGCTCTTCCC CGACCGTGAA GGCTACTTCG ATGTTTACGA ACATTACGGC
TTAGCTGGCA AGCGTTCTAT CTTCGCCCAT GCCGTTCACC TGACCGACAA AGAATGGAGT
GCATTTCAGC GTACCGATTC CGTCATTTCC TTTTGCCCTA CCTCCAACCT ATTCCTTGGT
AGTGGCCTAT TTGATTTAGA AAAAGCGGAA CAGAAAGGTG TACGCGTTGG CTTGGGCACC
GATGTAGGCG CAGGCACCAG TTTTTCTCAG TTAGAATCCT TAAATGAGGC CTATAAAATC
ATGCAGTTAC AAGGAAAGAA ACTCTCTGCG TTTAAAGGCT TGTACTTAGC GACATTGGGC
GGCGCGACCT CGCTTAGCCT TGATGACAAG ATAGGCAATT TTGAAGCAGG AAAAGAAGCC
GACTTCGTTG TCCTCAACTG GGCAGCAACG GATTTGCAGA AGTTGCGTTA CCAACATTCC
AAAAGCTTGG AAGATAAGCT CTTTGCATTA ATGATGTTAG GCGATGAGAG AAACGTCGAA
GCAACCTACA TAGCTGGCAA ACTGGCCTAT TCTGCCGATG GCGAAGTCAC TAGAACGACA
AAAATCGAGA GCTCAATCTA A
 
Protein sequence
MNKNAILLSL LGTASFSSQA AADITAIRSS LLHFINDPAK VEHLPDAYQY FEDGLLVIED 
GHIKAIKAFS ESDSTQYQDI VDHRGKLIVP GFIDTHIHYP QTQMIAAYGE QLLEWLETYT
FPTEKQFEDK AHAQAISQFF INELLKNGTT SALVFGTVHP QSVEALFEEA LDKNMRIIAG
KVMMDRNAPD YLLDTPETGY KESKSLINKW HNQGRLQYAI TPRFAPTSTP KQLAAAGKLK
AEYPDVYVHT HLSENKNEIE WVKSLFPDRE GYFDVYEHYG LAGKRSIFAH AVHLTDKEWS
AFQRTDSVIS FCPTSNLFLG SGLFDLEKAE QKGVRVGLGT DVGAGTSFSQ LESLNEAYKI
MQLQGKKLSA FKGLYLATLG GATSLSLDDK IGNFEAGKEA DFVVLNWAAT DLQKLRYQHS
KSLEDKLFAL MMLGDERNVE ATYIAGKLAY SADGEVTRTT KIESSI