Gene Namu_4166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4166 
Symbol 
ID8449792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4602534 
End bp4603814 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content76% 
IMG OID645043215 
Producthypothetical protein 
Protein accessionYP_003203444 
Protein GI258654288 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00815938 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG ACCTGCATGC GCCGGACGAA CTGATCGCCG CCGACGGGGC GCAGTTGCTG 
CCCTCGGCCG CGCTGGCCGG CGCCCAGGTG CGGTCCACAG CCGAACAGGT GGCCGGGCTG
AGCGACTTCG GTCGGCCCCG CGCCCTGGTC GTCGTCGGGG CGAGCGCCGC GATCGACACC
GCCCTGCTGC TCGCCCTGCT CGGGGATCCG CCGACCCCGG TGGTCACCGG TACGACGCTG
CCGTCGTGGG TGGGCCCGCT GGACATCGTG GTGGTGCTGG CCTCCGTCGT CGACGACATC
GCCTGCGCCG AAGCCGCTGC CCTGGCCGCG CGACGCGGCG CCCGGGTGAT CGTCCGGGCG
GCCGCCGACG GACCGGTGGC CCAGGCCGCC GCCGGGTCGC TGCTGATCGC GCCCGCGGTC
ACGGTGACCG AGGCCCTGGC CGGCGTCGCC CGGTTGACCG TCCTGGTCGC CGTGGCCGCC
GCGGCGGGTT TCGGGCGCCG TCCCGATTTC GCCACGGCCG CCGACCAGCT CGACGCCCTG
GCGATGGCCT GCCACCCGTC ATCGGAGTTC TTCGTGAACC CGGCCCTGAC CCTGGCCGAA
CACCTGGCCG ACGGGACTGC ACTGATCATC GGCACCGATC CGGTGGCCGA TGCCCTGGCC
GCCTACGCGG GCCGGGCGCT GATCGCGCTG GCCGGGCAGG CCGGCGCCGC CCTGCCGTCC
TACCTGGCGG CCGGCTCACC CCCGATCCTG GCCAGCGCCG CGCGCACGGA CGGCCCGACC
GGAACCTTCT ACGACCCCTT CGCCGACGGC GACGGGGACA GCGCCGGACA GCGGCTGAGC
AGTGTGCTGG TGATCGGCCC GTCGGCCGCC GGCCTGCCCC CGGGCGCGGG CCTGCTCCCG
CTGGAGTCGC TGCCCTCCTT CGGCAGCGAA CCGGTCGATC CCACGCGGGT CCCCGCGGCC
CCGCAATCGC CCCTGGCCGC GGCGCTGCAG GCGGCCCTGC CGCGGGCCAT CGTCATCGGG
CCCGAGGAAG CCCCGACCGA CGACACGGCC CCGCCCGGGG AGGCGACCGG CCCGGCACCC
GCCCGCGATG CGTTCGGCTG GACCCTGTCC ATGATGGCCC GGATCGACTT CGCAGCCGTC
TACCTTGGAC TGCGGTCGGG CACCCGCCCG CCGATGGACA GCCCGGACGG GCTGGGTCGT
CCCGGCAGTG CCGCGTTGCA CCTGCCGTCG ACGGGTGGCC GCGCCGCGTG GAGCGAGAGG
GAGTCCGGTT CGTGGAGCTG A
 
Protein sequence
MTADLHAPDE LIAADGAQLL PSAALAGAQV RSTAEQVAGL SDFGRPRALV VVGASAAIDT 
ALLLALLGDP PTPVVTGTTL PSWVGPLDIV VVLASVVDDI ACAEAAALAA RRGARVIVRA
AADGPVAQAA AGSLLIAPAV TVTEALAGVA RLTVLVAVAA AAGFGRRPDF ATAADQLDAL
AMACHPSSEF FVNPALTLAE HLADGTALII GTDPVADALA AYAGRALIAL AGQAGAALPS
YLAAGSPPIL ASAARTDGPT GTFYDPFADG DGDSAGQRLS SVLVIGPSAA GLPPGAGLLP
LESLPSFGSE PVDPTRVPAA PQSPLAAALQ AALPRAIVIG PEEAPTDDTA PPGEATGPAP
ARDAFGWTLS MMARIDFAAV YLGLRSGTRP PMDSPDGLGR PGSAALHLPS TGGRAAWSER
ESGSWS