Gene Msed_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1942 
Symbol 
ID5103329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1886825 
End bp1888000 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content48% 
IMG OID640507830 
Producthistidinol phosphate aminotransferase 
Protein accessionYP_001192006 
Protein GI146304690 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.95238 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACAAG AAGGTGCTTT CCTGAAGCTA ACTCCTCGAG AAACATGTGA TGGAAAAAGC 
TTAAAACAGG TTAAAAACTA TTCCTCCCGA GGGATTCTTA TTGCACCAAC CGATGTAAAG
GACATTATAT ACCCATGGTT AAAGGAGGCA AAGGAGTACG ATTTCTCGGA TATCAAGGAT
GGGATCAGGC TACACCTGAA TGAGTCCCCC TATCCGCCTC CCGATTTCGT CCTAGATGCG
GTAAAGAAGT ATTTAATTCA AGGGAACAGG TATCAGCATC CAGATCTAAC CAAGAGGTTT
AAGGAGCTCG CTGCCGAGTA TAATAAGGTT GAACCTGGGG AGATCTTTCC AACACCTGGT
GGAGACGGAG CCATAAGGTC AGTGTTCCTT AATCTCTCCA TGCCTGGGGA CAGGGTCGTG
TTGAACTATC CCAGCTACAG CATGTACTCG GTCTACTCTT CCTTTAGGGG GTTGAACCAG
GTTAGGGTTC CCCTTAGGGA GGAGGGAGAG TGGTGGAAGG AGGACTGGGA GAAGTTAGTT
ACTGAGGCCA GAGACGCTAG ATTGGTGGCA ATAGACGACC CCAATAATCC AACTGGCTCC
CCCATGATCA TGGGGGACGA GCAAAGATTG AGGGAACTGG TGGAGTCGAC CAAGGGGATA
GTCCTTCTTG ACGAGGCATA TTACGAGTTT TCGGGATACA CTGCCTCAAG GCTGGTGTCT
AAGTATCCGA ACCTCATGAT TGTGAGGACC ATGAGTAAGG CCTTCTCTCT TGCCTCCTTT
AGGGTGGGTT ATCTCATAGC TAACAGGGAT GTGGTAAAGG CCCTCGAGAA GGGATCCACG
CCCTTTGACG TTGCTCTTCC TTCTCTCATA GCTGGCATAA CCGCATTGGA AAATCCAGGT
TACGCACACA GGATTGCTCA GGAGATCTCG GAGAACAGGG AAGGATTATA CCAGGGATTA
ATTTCCCTCG GCGTGAAGGC TTACAGGTCA ATTACCAACT TCCTCTTGTT TAAACATTCA
GCGGAGCTGG TCGAGCCCTT GATGAGGAAA GGGATAGCCA TAAGGAACCC AGTAAAGGGA
TTTTATAGGG TATCAGTTGG GACAAAAGAG CAGTGTAATT TGTTCCTAAA TAAACTGGGT
GAAGTACTTG AAAATAGCGA TACCAAACAA AGGTAG
 
Protein sequence
MRQEGAFLKL TPRETCDGKS LKQVKNYSSR GILIAPTDVK DIIYPWLKEA KEYDFSDIKD 
GIRLHLNESP YPPPDFVLDA VKKYLIQGNR YQHPDLTKRF KELAAEYNKV EPGEIFPTPG
GDGAIRSVFL NLSMPGDRVV LNYPSYSMYS VYSSFRGLNQ VRVPLREEGE WWKEDWEKLV
TEARDARLVA IDDPNNPTGS PMIMGDEQRL RELVESTKGI VLLDEAYYEF SGYTASRLVS
KYPNLMIVRT MSKAFSLASF RVGYLIANRD VVKALEKGST PFDVALPSLI AGITALENPG
YAHRIAQEIS ENREGLYQGL ISLGVKAYRS ITNFLLFKHS AELVEPLMRK GIAIRNPVKG
FYRVSVGTKE QCNLFLNKLG EVLENSDTKQ R