Gene Veis_4374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4374 
Symbol 
ID4693292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4825060 
End bp4826250 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content68% 
IMG OID639852123 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_999095 
Protein GI121611288 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.279737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.478615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTTT GCAGCCCGCC ACGCTTGGCT GCTTGCCGCA CCCCGATTCA GCCCGCGCAC 
TGCAGACTCC CCCGAATGCC CGCTCCATCC CCGACCCCCC TGACCCATAT CCGCCCCGAT
GTGCGCGCCA TGCGCGCATA CCATGTGCAG CCGGCCACCG GCATGCTCAA GATGGACGCG
ATGGAAAACC CGTTCCGGCT GCCGGCCGAT CTGCAAACCG CGCTCGGCCA GCGCCTGGGC
GCTCTGGCGC TCAACCGCTA CCCGAGCGAC GCGCGCCTGG CCGAGCTGCA GGCCGCGCTG
GCGCGCTACG CCGGCCTGCC CGAAGGCCAT CGCATCATGC TCGGCAATGG CTCGGACGAA
CTCATCGCGC TGCTGGCCCT GGCCTGCGCC CGGCCCGGCA GCGGCGAGCG CCCCGGCGTG
CTGGCTCCGC TGCCCGGCTT TGTGATGTAT GCGTTGAGCG CGCAATTGCA GGGCCTGGAC
TTCGTCGGCG TGCCGCTGAC GGCCGATTTC GAGCTGGACG AGCCGGCGAT GCTGGCCGCC
ATCGCCCGGC ACCGGCCCGC GCTCACCTAC ATCGCCTACC CCAACAACCC CACGGCCACG
CTGTGGGACG AAGGCGCGGT GCAGCGCATC ATCGACGCGG TCGGCACGCA GGGCGGCATC
GTGGTGATGG ATGAAGCCTA TCAGCCCTTT GCCTGCCGTA GCTGGATCGG GCGCCTGCAC
GCCGAACCCG GGCGCAATGC CCATGTGCTG CTGATGCGCA CGCTCAGCAA GTTCGGCCTG
GCCGGTGTGC GCCTGGGCTA CCTGATCGGC CCGGCGGCCC TGGTCAACGA GATCGACAAG
GTGCGCCCGC CCTACAACGT GAACCTGCTC AGTTGCGAAA CCGCGCTGTT TGCGCTCGAA
CATGCCCCGG TGTTCGCCGC CCAGGCGGCC GAACTGCGCA CCCAGCGCGA CCTGCTGATC
GGTGCGCTGC GCCAGTTGCC CGGCATCGCA AAATGCTGGG ACAGCCAGGC CAACATGGTG
CTGGTGCGGG TGGCCGATGC CAGCCGCACC TACGAGGGCA TGAAAACCCT GAAGGTCTTG
GTCCGGAACG TTTCTACAAT GCACCCCTTG CTGAGCAACT GCCTGCGCCT GACGGTCGGC
AGTGCCGACG ACAACGCACA AATGCTGGCT GCACTCCAGG CCTCTTCATG A
 
Protein sequence
MAFCSPPRLA ACRTPIQPAH CRLPRMPAPS PTPLTHIRPD VRAMRAYHVQ PATGMLKMDA 
MENPFRLPAD LQTALGQRLG ALALNRYPSD ARLAELQAAL ARYAGLPEGH RIMLGNGSDE
LIALLALACA RPGSGERPGV LAPLPGFVMY ALSAQLQGLD FVGVPLTADF ELDEPAMLAA
IARHRPALTY IAYPNNPTAT LWDEGAVQRI IDAVGTQGGI VVMDEAYQPF ACRSWIGRLH
AEPGRNAHVL LMRTLSKFGL AGVRLGYLIG PAALVNEIDK VRPPYNVNLL SCETALFALE
HAPVFAAQAA ELRTQRDLLI GALRQLPGIA KCWDSQANMV LVRVADASRT YEGMKTLKVL
VRNVSTMHPL LSNCLRLTVG SADDNAQMLA ALQASS