Gene Elen_1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1080 
Symbol 
ID8415370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1306099 
End bp1307190 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content68% 
IMG OID645024043 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_003181440 
Protein GI257790834 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.463173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000154847 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTAGCG TGTGCGCGGC CGCGCCCCAG CTGGCGGGCG TGGTGCCCTA CGATCCGAAG 
TATTTGCCCG TCACCGCCGT CCTCTCGGCG AACGAGAACC CGCACGACGT CGACGACGAG
ATCCGCCGCG ACATCATGCG CGAGGTGAAG CGCCTGCCCC TCAACCGCTA CCCGGACCCG
CTGGCCAACG ACCTGCGCGA CATGATCGCC GAAGCGAACG GGCTCGACCG CGACCAGGTG
CTGGTGGGCA ACGGGGGCGA CGAGCTCTTG TTCAACCTGG CGCTGGCGTG GGGCGGGCCG
GGGCGCACGT TCCTCAACCT GCCGCCCACG TTCTCGGTGT ACGACGCGAA CGCCCGCCTC
ACGAACACGT CGGTGGTGGA CGTGCCGCGC CGCGCCGACT TCTCCATCGA CGAGGAGGCC
GTGCTGGCGC GCGTCGCCGA GGGCGGTATC GACTACCTGG TGGTGACCAG CCCGAACAAC
CCCACGGGGC AGCTTGCCAG CGAGACGTTC ATCCTCCGGC TGCTCGACGC CACCGATGCG
CTCGTGATGG TGGACGAGGC TTACTTCGAG TTCTCGCGCC AGACGATGCG CCCGTATCTG
GCGCAGCACA AGAACCTCGT CATCCTGCGC ACGTTCTCGA AGGCGTTCAG CCTGGCCGGG
GCGCGTATGG GCTACATCCT GGGAGACGCC GAGGTCGTGC GCGAGTTCGT CAAGGTGCGC
CAGCCGTATT CGGTGGACGC CGTCTCGCAG GCCGTTGCGC GCGTGGTGTA CGCGAACCGC
GCGAAGTTCG AGCGCGGCAT CCTGGCCGTC ATCGAGGAGC GCGCCCGCCT GATCGAGGGA
CTGAAGAGGA TTCCCGGCGT GAAGCCCTTC CCGTCGGATG CGAACTACGT GCTGTTCCGC
GTGGAGAACG CGCCCGTCAT CTGGGAGGCG CTGTACGAGC GCGGTGTGCT TGTGCGCGAT
TTCTCGCGTG CGGCGCATCT GGAGAACTGC CTGCGCGTGA CCGTGGGCGC CTCCGAGGAG
AACGACGCGT TTTTGCGCGC GCTGCGCGAT GCGGTGATGG GCAAGTGCGA TTTGAAGGTT
CCGTCGACCT GA
 
Protein sequence
MRSVCAAAPQ LAGVVPYDPK YLPVTAVLSA NENPHDVDDE IRRDIMREVK RLPLNRYPDP 
LANDLRDMIA EANGLDRDQV LVGNGGDELL FNLALAWGGP GRTFLNLPPT FSVYDANARL
TNTSVVDVPR RADFSIDEEA VLARVAEGGI DYLVVTSPNN PTGQLASETF ILRLLDATDA
LVMVDEAYFE FSRQTMRPYL AQHKNLVILR TFSKAFSLAG ARMGYILGDA EVVREFVKVR
QPYSVDAVSQ AVARVVYANR AKFERGILAV IEERARLIEG LKRIPGVKPF PSDANYVLFR
VENAPVIWEA LYERGVLVRD FSRAAHLENC LRVTVGASEE NDAFLRALRD AVMGKCDLKV
PST