Gene Tmel_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmel_0120 
Symbol 
ID5296554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermosipho melanesiensis BI429 
KingdomBacteria 
Replicon accessionNC_009616 
Strand
Start bp123251 
End bp124531 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content32% 
IMG OID640768372 
Productamidohydrolase 
Protein accessionYP_001305382 
Protein GI150020028 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR03314] putative selenium metabolism protein SsnA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCTA TCATTAATGC AAACATCTTT GATTATGAAA ACTATATAGA AAATCAATAC 
ATTCTATTTG AAAATCAAAT AATTGAAGTT GGTGCAATGG AAAATTATCC AGGAGCGCTT
TATGAAATTA ACGCAAAAAA CTCCATTGTA ATGCCTGGTT TTGTAGTAGG ACATACACAT
ATCTACTCCA CCTTTGCTCG TGGTATTAAC CTTTCTTTTT CACCAAAAAA TTTCAAAGAT
ATACTAGAAC AACTTTGGTG GAAAATCGAC GCAAAACTTG GAAAAGACGA AATATTCTAT
AGTGCTTTGG TTGCTGGTAT AGAATTTTTA AAATCGGGAG TTACCACTGT TTTTGATCAT
CATGCAAGCG GTGCTTTAAT TCGAAATAGC CTAAACACCC TAAAAGAAGC TTTAATTGAC
AATATAGGTT TAAGGGGGAT ATTTTGTTTT GAAACAAGTG ATAGATTCCC AGTAAGAAAA
TGTATAGATG AAAATCTAGA ATTTTTGCAA AACAACTCAC AAATGTATGC TGGGATTTTT
GGGTTACATG CTTCTTTAAG TTTATCAGAT AAGACACTTA AAACCATAGC AGAAGAATAC
AACGGACCAA TTCATATACA CGTTGCAGAA AGTATTGACG ATGTCGATTA TTCAGTTTCA
AATTACGGAC TTACAGTTGT CGAACGTTTA AACAAATTTG GATTATTAAG AAAAAATTCT
ATATTAGCCC ACTGTGTACA CGTTAGTGAA AAAGAACTAG GATTAATATC AAAAAACAAC
TGTTATGTTG CATTAAATGT CTCATCAAAT ATGAACAATG CAGTAGGGTT ACCAAATTAC
AAAAAGATGA AAACATTTAA TGTAAAAACA ATTGTTGGAA ACGATGGGCT TGGTTTTAAC
TTTGCACGGG AACTCTTAGC ACTATTATTT TCCATGAAAT TAAATGGCCA TTCTCCGTTG
GCATTTAACC TTAATGATCT TAAAAATGTT ATAACCAACA CTTACGAAAT TGCACAATAT
TATTTAAACG TAAAACTTGG CAGGATATTA CCTGGTTATG CTGCTGATTT TGTGATAGTA
CCATACACAC CACCTACACC CATAGACAAG ACCAACGCAT TTTCTCACTT TGTTTATGGA
ATTCTAGATA ATTTTAAACC TTCACATGGA ATAGTAAATG GAAAAATTCT AATGGAAAAC
CACAAAATAA ACCTTCAAGT AAATGAAATA TACAGTATTG CAAGAAAAAT TGCACAAAGA
TTATGGGAAT CACTAATATA A
 
Protein sequence
MKAIINANIF DYENYIENQY ILFENQIIEV GAMENYPGAL YEINAKNSIV MPGFVVGHTH 
IYSTFARGIN LSFSPKNFKD ILEQLWWKID AKLGKDEIFY SALVAGIEFL KSGVTTVFDH
HASGALIRNS LNTLKEALID NIGLRGIFCF ETSDRFPVRK CIDENLEFLQ NNSQMYAGIF
GLHASLSLSD KTLKTIAEEY NGPIHIHVAE SIDDVDYSVS NYGLTVVERL NKFGLLRKNS
ILAHCVHVSE KELGLISKNN CYVALNVSSN MNNAVGLPNY KKMKTFNVKT IVGNDGLGFN
FARELLALLF SMKLNGHSPL AFNLNDLKNV ITNTYEIAQY YLNVKLGRIL PGYAADFVIV
PYTPPTPIDK TNAFSHFVYG ILDNFKPSHG IVNGKILMEN HKINLQVNEI YSIARKIAQR
LWESLI