Gene TM1040_1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1981 
Symbol 
ID4077165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2084941 
End bp2086053 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content64% 
IMG OID638007296 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_613975 
Protein GI99081821 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.413428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACA GCCGCCAAAC CCCGCTCGCC CAATCCCTGC CCGCTTCGGT GCCCTTTGTT 
GGCCCCGAGA CCCATGAACG CCAGCGCGGC GCGCCTTTTG TGGCGCGGCT CGGTGCAAAT
GAGAACCTCT TTGGCGTCTC CCCCAAGGCG ATTGCCGCCA TGCAGGCCTC AGCGGCAGAG
ATCTGGAAAT ACGGCGATGC AGAGAGCTAC GAGCTGCGCG CGGCCCTCTC GGCGTTGCAT
GGCATTGCAC CCGAACACAT CATGGTTGGC GAGGGCATCG ACGGCCTCCT GGGCAATCTG
GTGCGGCTTT ATGTCGGCGC CGGCGATGCT GTGGTGACAT CGCTGGGGGC CTATCCGACC
TTCAACTATC ATGTGGCCGG TTTTGGCGGC GACCTTCACA CTGTGCCCTA CAAGGACGAC
CACGAAGACA TCAAAGCCCT GATGGCCAAG GCACATGAGG TGGGCGCAAA GCTGGTCTAT
CTCGCCAATC CTGACAATCC GATGGGCAGT TGGCATCGCG GTGCAGATAT TGTCGCCGCA
CTTGACGACC TGCCCGAAGG CAGCCTCTTG GTGCTGGATG AGGCCTATGT GGAATGCGCG
CCCAAAGGCA CCGCCGCCCC GGTCGATGTG ACCGACCCGC GCGTGATCCG CATGCGCACC
CTCTCCAAGG CCTATGGCAT GGCGGGGGCA CGCGTCGGCT ATGCCATGGG GGCGGTTGAA
GTCATCTCCG CCTTTCACAA GGTCCGCAAT CACTTTGGCA TGAACCGCTG CGCACAGATC
GGCGCAACCG AGGCCATCAA GGATCAGGCA TGGCTGGCTC ATGTGCAGGC CGAGATCGCC
ACCGCACGCG AAGAGATCTC GCGCATCGCT CGCGAAAACG GCCTCACACC GCTGCCTTCC
GCGACCAACT TCGTCGCCAT AGACTGCGGT CGCGATGGCG CCTTTGCCAA GGCGGTCTTG
GAGGCGCTGG TGGCGCGGGA CATCTTTGTG CGCATGCCAT TTGCAGCCCC GCAAAACCGC
TGCATCCGCG TCAGCTGCGG CCCCGAAAGA GAGCGCCGCG CCTTTGCCGA GGCCCTGCCG
CTGGCCCTCA AAGACGCGCA GAACGGCGCC TAA
 
Protein sequence
MTDSRQTPLA QSLPASVPFV GPETHERQRG APFVARLGAN ENLFGVSPKA IAAMQASAAE 
IWKYGDAESY ELRAALSALH GIAPEHIMVG EGIDGLLGNL VRLYVGAGDA VVTSLGAYPT
FNYHVAGFGG DLHTVPYKDD HEDIKALMAK AHEVGAKLVY LANPDNPMGS WHRGADIVAA
LDDLPEGSLL VLDEAYVECA PKGTAAPVDV TDPRVIRMRT LSKAYGMAGA RVGYAMGAVE
VISAFHKVRN HFGMNRCAQI GATEAIKDQA WLAHVQAEIA TAREEISRIA RENGLTPLPS
ATNFVAIDCG RDGAFAKAVL EALVARDIFV RMPFAAPQNR CIRVSCGPER ERRAFAEALP
LALKDAQNGA