Gene EcolC_1621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1621 
Symbol 
ID6066776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1801906 
End bp1802976 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content55% 
IMG OID641601036 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001724606 
Protein GI170019652 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.796299 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG TGACTATTAC CGATTTAGCG CGCGAAAACG TCCGCAACCT GACGCCGTAT 
CAGTCGGCGC GTCGTCTGGG CGGTAACGGC GACGTCTGGC TGAACGCCAA CGAATACCCC
ACAGCCGTGG AGTTTCAGCT TACTCAGCAA ACGCTCAACC GCTACCCGGA ATGTCAGCCG
AAAGCGATGA TTGAAAATTA CGCGCAATAT GCAGGCGTAA AACCGGAACA GGTGCTGGTC
AGCCGTGGCG CGGACGAAGG TATTGAACTA CTGATTCGCG CTTTTTGCGA ACCGGGTAAA
GACGCCATCC TCTACTGCCC GCCAACGTAC GGCATGTACA GCGTCAGCGC CGAAACGATT
GGCGTCGAGT GCCGCACAGT GCCGACGCTG GACAACTGGC AACTGGACTT ACAGGGCATT
TCCGACAAGC TGGACGGCGT AAAAGTGGTC TATGTTTGCA GCCCCAACAA CCCCACCGGA
CAACTGATCA ATCCGCAGGA TTTTCGCACC CTGCTGGAGT TAACGCGCGG TAAGGCGATT
GTGGTTGCCG ATGAAGCCTA TATCGAGTTT TGCCCACAGG CATCGCTGGC TGGCTGGCTG
GCGGAATATC CGCACCTGGC TATTTTACGC ACACTGTCGA AAGCTTTTGC TCTGGCGGGC
CTTCGTTGCG GATTTACGCT GGCAAACGAA GAAGTCATCA ACCTGCTGAT GAAAGTGATT
GCCCCCTACC CGCTCTCGAC GCCGGTTGCC GACATTGCGG CCCAGGCGTT AAGCCCGCAG
GGAATCGTCG CTATGCGCGA ACGAGTGACG CAAATTATTG CAGAACGCGA ATACCTGATT
GCCGCACTGA AAGAGATCCC CTGCGTAGAG CAGGTTTTCG ACTCCGAAAC CAACTACATT
CTGGCGCGCT TTAAAGCCTC CAGCGCAGTG TTTAAATCTT TGTGGGATCA GGGCATTATC
TTACGTGATC AGAATAAACA ACCCTCTTTA AGCGGCTGCC TGCGAATTAC CGTCGGAACC
CGTGAAGAAA GCCAGCGCGT CATTGACGCC TTACGTGCGG AGCAAGTTTG A
 
Protein sequence
MSTVTITDLA RENVRNLTPY QSARRLGGNG DVWLNANEYP TAVEFQLTQQ TLNRYPECQP 
KAMIENYAQY AGVKPEQVLV SRGADEGIEL LIRAFCEPGK DAILYCPPTY GMYSVSAETI
GVECRTVPTL DNWQLDLQGI SDKLDGVKVV YVCSPNNPTG QLINPQDFRT LLELTRGKAI
VVADEAYIEF CPQASLAGWL AEYPHLAILR TLSKAFALAG LRCGFTLANE EVINLLMKVI
APYPLSTPVA DIAAQALSPQ GIVAMRERVT QIIAEREYLI AALKEIPCVE QVFDSETNYI
LARFKASSAV FKSLWDQGII LRDQNKQPSL SGCLRITVGT REESQRVIDA LRAEQV