Gene Namu_5198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5198 
Symbol 
ID8450829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5793913 
End bp5795493 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content76% 
IMG OID645044229 
ProductCHAD domain containing protein 
Protein accessionYP_003204453 
Protein GI258655297 
COG category[S] Function unknown 
COG ID[COG3025] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCGC GTCAGAAGAA GTTCACCCCG GGTTCGGGGG CCGGGCCCGC GACCCCGGCT 
CCCGCCGGAC CCACCCGGCA GCTGGAGATC GAGACCAAGC TGGAGCTGGA CCCGGACGCG
CCGCTGCCCG CCCTGACCAA GCGCAAGCGG CTGGCCGCCG TCGGCATCGC CGGTGCGGCC
GAGCCGATCA GCCATCACCT GGACGCCCTG TATTACGACA CCGACCAGCT GGACCTGCTG
CGGTCCAAGG TCACCCTGCG CCGCCGCACC GGCGGCGCCG ACGCCGGCTG GCACCTCAAG
CTGCCCGCCG TCCAGGGGGC CCGCACCGAG ATCGGCCTGC CGCTGAGCGC CGGCGCGGAG
GGCGTCGTGC CCGAGCAGAT CGCCGCGCTG GTCCTGGGGG CCGCCCGCGG ACGGCCGCTG
GGTCCGGTCG GCCGGATCGT CAACGACCGC GTCGTGCGGC ATCTGCTGGC CGCCGACGGC
ACCGTGCTGA TCGAGGTCGC CGACGACCAC GTCACCGGCA CCGGCCTGGC CGACGGGTAC
CAGGGCACCC AGCGCTGGCG GGAGGTCGAG GTGGAGATCG TCGACGGCAC CCGCGACCAG
CTGGCCGCCA CCGTCGACGT ACTGACGTCC GGCGGCGCCC GCCCGGCCGA CTCACCGTCC
AAGCTGGCCC GGGCCCTGGG CTACCGGCCG GCCGAGCCGC GCCGGGGCAA GACCGCCGGC
GACATGGTGG TCACCGCGCT GGGCCGCCAG CGCGACCGGT TGATCACCGC GGACCGGGCC
ATCCGGGACG GGGACACCGG CGCCGTCCTG GATGCGCGCA CCCTGTGCCG GCGGATCGGT
TCGGCGCTGG CGGTGTTCGC GCCGCTGTTC GACGGCCCCG CGGTCGCGCC GCTGCGCGAG
GCGTTGACCA CCGCCGCCGG GTTGCTGGAC GGCGCCCGCG ACGTGCAGTC CGCCCGGGGC
CGCCTGGTCG AGCAGCTGAC CGAGGAACCC GCCCCCTACC GGGACCGGGC CCGCACCCGG
CTGGAGCAGG CCTGCGATCG GCGGCTGGCC GCCGCGACCG ACCGCGCCCG CGCCTACCTC
GACGGGGCGG ACTACCTGAT GATGCTGCGC ACGCTGGACG AGTTCCTGGC CGCGCCGCTG
CTGACCAAAC GGGCCGGCCG GGCCGCCCCG CGCGAGCTGG CCGCCCTGCT CGGCGCCGGG
TGGCAGCGCC TGCAGGAACT GGCCGACGCG GCCCTGGCCG ACCCGTCCCG CACCGCGCCG
CTGCGCGACG TGCGGGACTG GGCGGCGAGC ATGCGGTACG CGACCGAGCT GACCGTCGGC
CCGCTCGGGC CGGACGCGGC GGCGCTGGCC TCCGCGCTGG AGGAGGTCCA GGAGTCGGTC
GAGGAACACC TGGACGCCCG CGGCGCCGCC GACCTGCTGG CCACCCTGGC CATCGAGGAC
GGCACCGACG GCGTCGCCGG ATTCATCTTC GGCCGCCTGC ACGCGGTCGA GCAGAACCTC
GCGCACGCCG CCGTCGACGA CTTCACCGAC GCCTGGGACC GGATCGAGGA CGGCGAACTG
GTCGCCGGGT TGGGTCACTA G
 
Protein sequence
MAARQKKFTP GSGAGPATPA PAGPTRQLEI ETKLELDPDA PLPALTKRKR LAAVGIAGAA 
EPISHHLDAL YYDTDQLDLL RSKVTLRRRT GGADAGWHLK LPAVQGARTE IGLPLSAGAE
GVVPEQIAAL VLGAARGRPL GPVGRIVNDR VVRHLLAADG TVLIEVADDH VTGTGLADGY
QGTQRWREVE VEIVDGTRDQ LAATVDVLTS GGARPADSPS KLARALGYRP AEPRRGKTAG
DMVVTALGRQ RDRLITADRA IRDGDTGAVL DARTLCRRIG SALAVFAPLF DGPAVAPLRE
ALTTAAGLLD GARDVQSARG RLVEQLTEEP APYRDRARTR LEQACDRRLA AATDRARAYL
DGADYLMMLR TLDEFLAAPL LTKRAGRAAP RELAALLGAG WQRLQELADA ALADPSRTAP
LRDVRDWAAS MRYATELTVG PLGPDAAALA SALEEVQESV EEHLDARGAA DLLATLAIED
GTDGVAGFIF GRLHAVEQNL AHAAVDDFTD AWDRIEDGEL VAGLGH