Gene Namu_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1707 
Symbol 
ID8447309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1870567 
End bp1871862 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content75% 
IMG OID645040833 
Productamidohydrolase 
Protein accessionYP_003201086 
Protein GI258651930 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0000668627 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0323622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGGCG CGACGATCGT GGCCGACCCG ACCACTGTGC ACCGCGAGGC CGAGCTGGCC 
TGGTCGGGCG TCACCGGCCG GATCAGCTAC CTCGGCCCGG TGCGTGGCCC GGCCCGCCCG
GGCGACCTGG TCGGGGCCGG GCGGCTGGTG CTGCCCGGGC TGGTCAACGC GCACACCCAC
GCCGGCATGT CGCTGCTGCG CGGCTACAGC AACGAGGAGC CGCTGCACCG CTGGCTCGAG
CGGGTCCGCG CGTTCGAGGT GCGGATGACC CGGGCCGACA TCCGGGCCGG CCTGCGGCTC
TCGCTGGTCG AGATGATCCG CTCCGGCACG GTCGCCTTCG CCGACATGTA CCTGTGGGAT
TCAGGTTTGC TGGCCGACGT GCACGACGCC GGGTTGCGCG TGCTGGCCGC GACCGCGGTC
TTCGGCTACG ACGCCGTCGC CTACCCGGCG GCCAGCCCGC AGACGGGGGC GCAGGTGCTG
GACGGCACCC CGGCGCTGGC CGCCGAGTTC GCCGGGGACG AGCTGGTGCA GGTCACCTTC
GGGCCGCACG CGCCCTACAC CTGCGGGGCG CAGCTGATGG CCGACGTCGC CGGTCGGGCC
GCCCGGCACG ATCTGGCCGT GCACATCCAC CTGTCCGAAA CAGCCCGCGA GCTGGCCGAA
AGCCGGGAGC GGCACGGCTG CTCACCGATC GAGCTGGCCG CCTGGACCGG GCTGCTCGCC
GGGCCGGTGC ACATCGCGCA CGCGGTGCAT CCGGACGATG ACGACACCGC CCGGCTGGCC
GCCCGCGGGG TCACGGTGGC GCACTGCCCG GTCTCCAACC TCAAGCTCGG CGCCGGCATC
GCGCCCGTCC CGCAGTACCT GACCCGGGGC GTCACCGTCG GCCTGGGCAC CGACTCGATG
GCCTCCAACA ACACCGCCGA CCTGTTCGAG GAGATCAAGA CGGCCGCACT GGTGGCCCGC
GGGGTGGCCC AGGACCCGAC CGCGGTCGGG GCGGCGGACG CGCTGCGGAT GGCGACGCAG
GGCGGCGCGC GGGCCTTCGG TGGCCGGTTG TCCGGACGCC TGGCGGTCGG GGAGCCGGCC
GACCTGGTGC TGCTGGACGT GACCGCCGCG CACGCCACCC CGATGCCGGA CCCGGTGGCC
CACCTGGCCT ACGGGGCGCG CGGCGCCGAC GTCACCGACG TGGTGGTCGC CGGCCGGCCC
CTGCTGGTGC AGGGCCGGCT GACCACCCTG GACGAGGACG GGATCCGGGA CGAGGCGAAC
CAGCGGGTCG CCCGGATCCT GGGTCAGGCG GACTAG
 
Protein sequence
MVGATIVADP TTVHREAELA WSGVTGRISY LGPVRGPARP GDLVGAGRLV LPGLVNAHTH 
AGMSLLRGYS NEEPLHRWLE RVRAFEVRMT RADIRAGLRL SLVEMIRSGT VAFADMYLWD
SGLLADVHDA GLRVLAATAV FGYDAVAYPA ASPQTGAQVL DGTPALAAEF AGDELVQVTF
GPHAPYTCGA QLMADVAGRA ARHDLAVHIH LSETARELAE SRERHGCSPI ELAAWTGLLA
GPVHIAHAVH PDDDDTARLA ARGVTVAHCP VSNLKLGAGI APVPQYLTRG VTVGLGTDSM
ASNNTADLFE EIKTAALVAR GVAQDPTAVG AADALRMATQ GGARAFGGRL SGRLAVGEPA
DLVLLDVTAA HATPMPDPVA HLAYGARGAD VTDVVVAGRP LLVQGRLTTL DEDGIRDEAN
QRVARILGQA D