Gene Noca_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3044 
Symbol 
ID4600161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3240906 
End bp3241994 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content72% 
IMG OID639777650 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_924233 
Protein GI119717268 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCTTCC CACCCCTGCG CGAGGAGCTG CGGGGCATCG AGCCCTACGG CGCGCCGCAG 
CTGGACGTTC CCGTCCAGCT CAACGTCAAC GAGAACCCCT ACGGGCCATC GCCGGCCTGC
GCCGCTGACA TCGCGGCCGC GGTCGCGCTG GCCGCGGGCA CGCTGAACCG CTACCCCGAC
CGCGAGTTCG TCGACCTGCG GATGGCACTG GCGTCGTACC TCGGCCACGG CGTCACCCAC
GAGCAGGTGT GGGCGGCGAA CGGGTCCAAC GAGGTGATGC TCCAGCTGCT CCAGGCGTTC
GGCGGCCCGG GCCGGGTGGC GCTGAGCTTC GCCCCGACGT ACTCCATGTA TCCCGAGTAC
GCCCGCGACA CCGTCACCGA GTGGGTCGTC GGGCACCGCG AGTCCGACTT CGCGCTCGAT
CTCGACCACG CGCACGACCT CGTCAAGGAG CGCCAGCCGA GCGTCGTGCT GCTCCCGAGC
CCGAACAACC CGACCGGCAC CGCGCTGCCG CTCGACGCCG TCACCGCGCT GTGCGAGGCG
GCGGCCGGGA ACGAGCAGCC CGGGGTCGTC GTGGTCGACG AGGCGTACGG CGAGTTCCGC
CGGGCCGGCA CGCCCAGCGC GCTGGAGCTG CTGCCGCGGC ACCGCAACCT GGTGGTGACC
CGCACGATGA GCAAGGCGTT CGCGCTGGCC GGTGCCCGGG TCGGCTACCT GGCGGCGGCG
CCGGAGATCT GCGACGCGAT CCGGGTCGTG CGGCTGCCGT ACCACCTGTC CGCGGTCACC
CAGGCGACCG CGCTCGCGGC GCTACGGCAC GCGCCGGAGC TGCTCGGCAA GGTCGACGAG
CTGCGGGCCG AGCGCGACCG CACGGTCGAC TGGCTGCGCG AGCAGGGCCT GACGGTCGCG
GACACGGATG CGAACTTCGC GCTGTTCGGG ACCTTCGCCG ACCGGCATGC TGTGTGGCAG
GGGTTGCTGG GCCGGGGGGT GCTGATCCGG GAGACCGGCC CGGACGGCTG GCTGCGGGTC
TCGATCGGCA CCGCCGAGGA GATGCAGGCA TTCAAGGACG CACTGACCCA GGTCAGGAAG
GAAATGTGA
 
Protein sequence
MTFPPLREEL RGIEPYGAPQ LDVPVQLNVN ENPYGPSPAC AADIAAAVAL AAGTLNRYPD 
REFVDLRMAL ASYLGHGVTH EQVWAANGSN EVMLQLLQAF GGPGRVALSF APTYSMYPEY
ARDTVTEWVV GHRESDFALD LDHAHDLVKE RQPSVVLLPS PNNPTGTALP LDAVTALCEA
AAGNEQPGVV VVDEAYGEFR RAGTPSALEL LPRHRNLVVT RTMSKAFALA GARVGYLAAA
PEICDAIRVV RLPYHLSAVT QATALAALRH APELLGKVDE LRAERDRTVD WLREQGLTVA
DTDANFALFG TFADRHAVWQ GLLGRGVLIR ETGPDGWLRV SIGTAEEMQA FKDALTQVRK
EM