Gene VIBHAR_02074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVIBHAR_02074 
Symbol 
ID5554421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio harveyi ATCC BAA-1116 
KingdomBacteria 
Replicon accessionNC_009783 
Strand
Start bp2072888 
End bp2073925 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content49% 
IMG OID640907561 
Productformimidoylglutamase 
Protein accessionYP_001445266 
Protein GI156974359 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCTCA CGGAGGACTT TAACATGGCT CAATCAGACC CAAATACCGC TCCATTTCAC 
TGGCAAGGTC GCCACGATGC AGAAGATGGA GAACTGGGTA AGCGCGTTCA CCACGTGATT
AAAAATATTT CGGTTGAAGA GTTACCGAGC AAAAGCGAAG GCGTCTCAAT ACTTGGTTTT
GCGACCGACG CGGGTGTTGC TAGAAATAAA GGCCGTATTG GCGCAAAAAA GGCACCGGAT
TTAATCCGTC GTGCCCTAGC CAATCTCGCT TGGCACCAAG ATGCACCGCT TTATGACCTC
GGTACTGTCG TTTGCGAAGA CGACCTACTA GAGAGCAGCC AATCGCAATG CGCAGCGACG
GTTGCTCAAG CTCTACCCCA TTCACCTGTC GTAGTATTAG GCGGCGGGCA TGAGATCGCA
TGGGCGTCAT TTTCAGGGTT AGCCGAGTAC TTCAAAACCC ATCACCCAGA AAAGCAGCCG
AAGATTGGCA TTATCAACTT CGACGCACAC TTCGACCTAC GCGCTTTTGA AAGTTCGCTG
GCAGATGTAA AACCGAGCTC AGGCACACCA TTTAATCAGA TTCATCACTT CTGCCAACGC
AATGATTGGA AGTTTCATTA CGCTTGCATT GGCGTCAGTC GCAGCAGCAA TACCAAAGCG
CTATTCCAGA AAGCGGACGA ACTCAATGTT TGGTATATGG AAGACAAACA GCTTTGCTAC
ATGAATCACA GCTACCATTT AACGCAGCTA CAGCACTTTA TCGATCACTG CGATTACCTC
TATCTGACGA TTGATTTGGA CGTGTTCCCT GCGGCCACAG CTCCGGGTGT AAGCGCTCCA
GCACCAAGAG GGGTCAGCTA CGACATCATT TCACCGTTTC TCGACCGAAT CCTACATTAC
AAAAACAAGC TCATGCTGGC AGACATTGCC GAGTATAACC CTACCTATGA CGTCGATAGC
CAAACCGCTC GATTGGCAGC CCGCCTATGT TGGGACATCG CCAATGCCAT GGCAGAGAAA
GACCATAAAC CAAAATAA
 
Protein sequence
MSLTEDFNMA QSDPNTAPFH WQGRHDAEDG ELGKRVHHVI KNISVEELPS KSEGVSILGF 
ATDAGVARNK GRIGAKKAPD LIRRALANLA WHQDAPLYDL GTVVCEDDLL ESSQSQCAAT
VAQALPHSPV VVLGGGHEIA WASFSGLAEY FKTHHPEKQP KIGIINFDAH FDLRAFESSL
ADVKPSSGTP FNQIHHFCQR NDWKFHYACI GVSRSSNTKA LFQKADELNV WYMEDKQLCY
MNHSYHLTQL QHFIDHCDYL YLTIDLDVFP AATAPGVSAP APRGVSYDII SPFLDRILHY
KNKLMLADIA EYNPTYDVDS QTARLAARLC WDIANAMAEK DHKPK