Gene TM1040_2584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2584 
Symbol 
ID4077495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2721455 
End bp2722450 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content65% 
IMG OID638007908 
Productaminotransferase 
Protein accessionYP_614578 
Protein GI99082424 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000040763 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGAAC GCTCTTCCTT TGCGGCAACG CAAGACCCGG CGCAGGCCTC GGTCTCCGGT 
TCCCCCCGCG ATCATGGCGG CAATCTCGCC GCCGCGATCG CACAGTATGG CGGCAGCCGC
GAAGACTGGA TTGATCTCTC CACAGGGATC AACCCTGCAC CCTACCCTCT GCCGCCATTG
CTGGCCGAAG ACTGGACCGC TCTGCCCGAT CAAGATGCCC AGCAGGCTCT GACGGATGCA
GCGCGCCGGT TCTGGCAGGT ACCCGACAGC GCAGAGATCC TTGCCGCCCC CGGCGCTTCG
GCGCTGATTG CACGGCTCCC TGTCCTGCGC CCGGCACGCA GCGTGCAAAT CACGACGCCC
ACCTACAACG AACACGCCGC TGCATTTGAC AATTGTGGCT GGCAGGTGCT TGAGGTGGGT
CCAACGGACG CGCAAGTTGT GGTTCACCCC AACAATCCCG ACGGCCGCCT GTGGCTGCCC
GAGGAACTCA AGGCACCTTT CTGCATCATT GACGAGAGTT TCTGTGACAT CTGCCCGGAG
CGGAGTCTGA TCAACCGCGC AGCACAGCCG GGCACCGTGG TCCTCAAGAG CTTTGGCAAG
TTCTGGGGGC TCGCGGGGCT GCGGCTCGGG TTTGCCATCG GCGATCCGGA GCTCATCGCC
GGACTGCGCG CAGCCATGGG GCCGTGGGCG GTGTCCGGCC CGGCGCTGCG CGTGGGAACG
CACGCTCTGA ACGATCAGAA CTGGGCCGAG GCCAGCCGCC TTGAGCTTGC GGATGGAGCC
TCGCGCCTTG ACCAGCTGAT GCAGCACGCC GGCGCACAGA CCGTAGGCGG CACCGATCTC
TTCCGGCTTT ATGACGTTGA AGATGCCCGC GCCTGGCAGG AGCGCCTCGC GCGTGCGCAC
ATCTGGAGCC GCATCTTTCC CTATTCAACC CGTTTCCTGC GCCTTGGCCT GCCTCCCGCA
GACCGCTGGA GCCAACTGGA GACCGCCCTC ACATGA
 
Protein sequence
MAERSSFAAT QDPAQASVSG SPRDHGGNLA AAIAQYGGSR EDWIDLSTGI NPAPYPLPPL 
LAEDWTALPD QDAQQALTDA ARRFWQVPDS AEILAAPGAS ALIARLPVLR PARSVQITTP
TYNEHAAAFD NCGWQVLEVG PTDAQVVVHP NNPDGRLWLP EELKAPFCII DESFCDICPE
RSLINRAAQP GTVVLKSFGK FWGLAGLRLG FAIGDPELIA GLRAAMGPWA VSGPALRVGT
HALNDQNWAE ASRLELADGA SRLDQLMQHA GAQTVGGTDL FRLYDVEDAR AWQERLARAH
IWSRIFPYST RFLRLGLPPA DRWSQLETAL T