Gene TM1040_2225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2225 
Symbol 
ID4078216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2336081 
End bp2337271 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content62% 
IMG OID638007547 
Productkynureninase 
Protein accessionYP_614219 
Protein GI99082065 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3844] Kynureninase 
TIGRFAM ID[TIGR01814] kynureninase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.810346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACC TGCCCAAAAA ATACCTCTTC GATATTCCCG AGGGCATGAT CTATCTCGAT 
GGAAACTCGC TTGGCCCCCT GCCAAAGGGC GCGGCAGAGC GGGCCGCCAA GGTGCTGACC
CAGGAATGGG GCACACAGTT GATCAAGGCC TGGAACACCG CCGACTGGAT GGCGCTGCCG
CAAAAAGTGG GCGATCGTAT CGCGGGGTTC ATCGGCGCAG CACCGGGCAG CGTGGCCACG
GGCGATACGC TTTCGATCAA GGTTTATCAG GCGCTCGCGG CGGCGCTCAA GATGCGCCCC
GAGCGCCGGG TGATCCTGTC GGACACGGGC AATTTTCCGA CCGATCTCTA CATGGCGCAG
GGGCTGATCT CCACCATCGG CAAGGACTAT GAACTGCGCA CCGTTGCCCC CGAAGAGGTC
GCGGATGCGA TCACCGATGA TGTGGCGGTG GTGATGCTGA CGGAGGTGGA CTATCGCTCT
GGCCGCCGTC ACGACATGAT GGAGATGACA GCACGCGCGC ATCAGAACGG CGCGGTGATG
ATCTGGGACC TCGCCCATAG CGCAGGCGCG CTGCCGGTGG ATCTGACGGC CTGCAATGCA
GAATTCGCGG TGGGCTGCAC CTACAAGTAT TTCAACGGGG GACCCGGTGC GCCTGCCTTT
ATCTATGCGC GGCCCGACAT TGTGCTTGAG GTGGACCCTG CGCTTGCGGG CTGGCTTGGT
CATGATGCGC CTTTTGCGAT GGAGCCCGAT TATCGTCCGG CGATGACCAC GGAGCGTCTG
CGCGTTGGCA CGCCCTCGAT TGTGCAGCTC TCGATCCTTG ATACGGCACT GGATGTTTGG
GACGGGGTCT CGATGGAAGA GATCCGCGGC GCGTCCGTGG CCCTGTGCGA GACGTTCATT
GCCGAAGTCG AGGCCCGCTG CCCGGAACTG ACGCTTGCCT CCCCCAGAGA GGCAGCGCTG
CGAGGGTCGC AGGTCTCCTT TGCCTTTGAG GATGGCTATG CGGTGGTACA GGCGTTGATT
GATCGCGGCG TCATCGGCGA TTTCCGCGCG CCCAACATCA TGCGCTTTGG TTTCACACCG
CTCTATCTCG ATCAGGCGGA TGTGGTGCAA GCCGCCGAGA TCCTTGAGGA TGTGATGAAG
CGAGAGAGTT GGAAAGATCC CAAGTATCAG GTGCGCTCGC GCGTGACCTG A
 
Protein sequence
MTNLPKKYLF DIPEGMIYLD GNSLGPLPKG AAERAAKVLT QEWGTQLIKA WNTADWMALP 
QKVGDRIAGF IGAAPGSVAT GDTLSIKVYQ ALAAALKMRP ERRVILSDTG NFPTDLYMAQ
GLISTIGKDY ELRTVAPEEV ADAITDDVAV VMLTEVDYRS GRRHDMMEMT ARAHQNGAVM
IWDLAHSAGA LPVDLTACNA EFAVGCTYKY FNGGPGAPAF IYARPDIVLE VDPALAGWLG
HDAPFAMEPD YRPAMTTERL RVGTPSIVQL SILDTALDVW DGVSMEEIRG ASVALCETFI
AEVEARCPEL TLASPREAAL RGSQVSFAFE DGYAVVQALI DRGVIGDFRA PNIMRFGFTP
LYLDQADVVQ AAEILEDVMK RESWKDPKYQ VRSRVT