Gene TM1040_3005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3005 
Symbol 
ID4078035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3173217 
End bp3174740 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content64% 
IMG OID638008334 
Producthistidine ammonia-lyase 
Protein accessionYP_614999 
Protein GI99082845 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.36207 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGA TGATCCCGGG CGCCGTGACG CTCGACACTT TGGAACGGAT CTGGCGTCAC 
GGCACGCCTG CACGTCTGGC CGATAGCGCG CGTGCAGGTG TCGAGGCCGC TGCCGCGATG
GTGGCAGAGG CCGCAGCGGG TGAAGTGCCC GTTTATGGCA TCAACACTGG TTTTGGCAAA
CTTGCGTCGA CCAAGATCGC GCCGGAGGAT ACTGCGACCC TGCAGCGCAA CCTGATCCTG
AGCCATTCTT GCGGTGTGGG CGAGCCGCTG GCCGAAGACA AAACCCGGTT GATGATGGTG
CTGAAGCTGT TGTCGCTGGG TCGGGGCGCC TCCGGGGTGC GCTGGGCTGT CATTGAGCAG
ATCCAAGAGA TGCTGGCACG CGGGGTGACA CCTGTTGTGC CGTCGCAAGG GTCCGTTGGC
GCCTCTGGTG ATCTTGCCCC GCTTGCCCAC ATGACTGCGG CCATGATCGG CGAAGGCGAG
GCAACAATCG ACGGCGTGCG CCTGCCTGGT GCCGAAGCCT TGAGGCGTGC CGGGTTGGAG
CCGATTGTGC TGGGCCCGAA AGAAGGGCTT GGCCTGATAA ATGGCACGCA GTTTTCCACC
GCTTGCGCGC TCACCGGGTT GTTTGAGGCC CTGGAGATGG CGCGGGCCTC CATGGCGATC
GCGTCTTTGA CAACCGACGC TATCATGGGC TCTACCGCGC CTTTGGTGGC GGATATTCAC
AGCTTGCGCG GCCATGCTGG GCAGATGGAG GTCGCGGCAA CGATGCGCGA CATCATGGCG
GGCTCGGAAA TTCGCGAGAG CCACCGTGAG GGCGACACCC GCGTGCAGGA TCCCTATTGC
ATCCGCTGCC AGCCTCAGGT GGTGGGCGCC GCGCTTGATG TGCTGCGCAT GGCGGCGCGC
ACGCTTGAAA TCGAGGCGAA CGCGGTCACC GACAATCCGT TGGTACTGGT GGAGGCGGGG
CAGATCGTCT CCGGGGGCAA CTTCCATGCC GAATATGTGG GCTTTGCGGC AGATCAGATC
GCGCTCGCCG TGGCTGAGAT CGGCGCGATT GCGCAGCGCC GGGTTGCGCT GATGGTGGAT
CCTACCCTGA GCCACGACCT ACCACCGTTC CTGACGCCGA ACCCCGGCCT CAACTCGGGA
TTCATGATTG CCGAAGTCAC GACTGCGGCG CTCATGAGCG AAAACAAACA TCTGGCCAAC
CCCTGCGTTA CGGATTCCAC ACCGACCTCC GCCAACCAAG AGGACCACGT CTCTATGGCG
GCGCACGGTG CGCTGCGGCT GGCGAAAATG AACGCAAACC TGTCGGTGAT CCTTGGGGTC
GAGATGCTTT GCGCGGCGCA GGGGGTCGAG GCGCGCGCGC CGCTCAAGAC CTCTAGCCGC
TTGCAGAACC TGCTCGACAT GCTGCGCGGC GAGATCCCGA GCCTTGGCGA GGACCGCTAT
CTTGCGCCGG AAATCGAAAC CGCCAGCGCG ATGGTGCGGG CAGGCCGCGT GGCGCAGGCC
GCAGGCGTGG AGGTCAGCAC ATGA
 
Protein sequence
MIEMIPGAVT LDTLERIWRH GTPARLADSA RAGVEAAAAM VAEAAAGEVP VYGINTGFGK 
LASTKIAPED TATLQRNLIL SHSCGVGEPL AEDKTRLMMV LKLLSLGRGA SGVRWAVIEQ
IQEMLARGVT PVVPSQGSVG ASGDLAPLAH MTAAMIGEGE ATIDGVRLPG AEALRRAGLE
PIVLGPKEGL GLINGTQFST ACALTGLFEA LEMARASMAI ASLTTDAIMG STAPLVADIH
SLRGHAGQME VAATMRDIMA GSEIRESHRE GDTRVQDPYC IRCQPQVVGA ALDVLRMAAR
TLEIEANAVT DNPLVLVEAG QIVSGGNFHA EYVGFAADQI ALAVAEIGAI AQRRVALMVD
PTLSHDLPPF LTPNPGLNSG FMIAEVTTAA LMSENKHLAN PCVTDSTPTS ANQEDHVSMA
AHGALRLAKM NANLSVILGV EMLCAAQGVE ARAPLKTSSR LQNLLDMLRG EIPSLGEDRY
LAPEIETASA MVRAGRVAQA AGVEVST