Gene Arth_1470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1470 
Symbol 
ID4446001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1631267 
End bp1632385 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content65% 
IMG OID639689281 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_830964 
Protein GI116670031 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0168842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTGACC AGCTAGAGCG CCTGAACCGA CTTCCCCTCC GGACCAACCT GCGTGGCCTC 
ACCCCGTATG GTGCGCCGCA GCTGGACGTT CCCATCCTGC TCAATGTCAA CGAAAACACC
CATGGCGTTC CGGCCGATGT CCTCGTGGCC ATTTCCGAGG CCGTGACCGC GGCTGCTGCG
GGGCTGAACC GCTATCCGGA CCGTGAGTTC ACCGAACTCC GGGAGCGACT GGCCGAGTAC
CTTGGCCACG GCCTGGGTGC GGAGAACATC TGGGCCGCCA ACGGATCCAA CGAAGTGCTG
CAGCAGATTC TGCAGGCATT CGGCGGTCCG GGACGTACCG CACTTGGTTT CCCGCCCACG
TACTCCATGT ATCCCCTCCT GGCGAGCGGG ACCGACACCG AATACATCGT CGGCCAGCGT
GCGGACGACT ATGGCCTCAG TGCCGAATCC GCTGCGCAGC AGGTCCGGGA ACTGCAGCCG
AACATCGTTT TCCTGTGTTC ACCCAACAAC CCCACCGGCA CCGGGCTGGG ACTGGATGTG
GTGGAGGCCG TGTATGCGGC AGGCGAGGCC AGCCAGACCG TCGTGATCGT CGATGAGGCT
TACCACGAAT TCGCGCACGA CGGCACGCCC AGCGCCCTCA CGCTTCTTCC AGGCCGTGAG
CGGCTTATCG TCTCCCGAAC CATGAGCAAG GCATTTGCGC TGGCCGGAGC ACGCCTGGGC
TACATGGCTG CCGCGCCCGA GGTTGCGGAT GCACTGCGGC TGGTGCGGCT GCCGTACCAC
CTGTCCGCTA TCACCCAGGC CACTGCGCTG GCTGCCCTGA CCCACCGCGA GGCACTGATG
GCCGACGTCG AAGACATCAA GCTGCAGCGC GACCGGATTG TCACGGAACT GACCAGAATG
GGCCTCAAGC CTGCCGCGTC CGACTCCAAC TACGTTTTCT TTGGCGGCCT GGAGAACCCG
CACGAGGTCT GGCAGGGGCT GCTCGACCGC GGGGTCCTGA TCCGGGACGT TGGGATCCCC
GGGCACTTGC GCGTCACGGC AGGCACTGAG ACGGAAACCA CAGCCTTCCT GGAAGCCCTT
GAACTGATCC TGACCGGCCA GCCCAGCGTC CCGGCCTAA
 
Protein sequence
MTDQLERLNR LPLRTNLRGL TPYGAPQLDV PILLNVNENT HGVPADVLVA ISEAVTAAAA 
GLNRYPDREF TELRERLAEY LGHGLGAENI WAANGSNEVL QQILQAFGGP GRTALGFPPT
YSMYPLLASG TDTEYIVGQR ADDYGLSAES AAQQVRELQP NIVFLCSPNN PTGTGLGLDV
VEAVYAAGEA SQTVVIVDEA YHEFAHDGTP SALTLLPGRE RLIVSRTMSK AFALAGARLG
YMAAAPEVAD ALRLVRLPYH LSAITQATAL AALTHREALM ADVEDIKLQR DRIVTELTRM
GLKPAASDSN YVFFGGLENP HEVWQGLLDR GVLIRDVGIP GHLRVTAGTE TETTAFLEAL
ELILTGQPSV PA