Gene Namu_2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2135 
Symbol 
ID8447746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2353085 
End bp2354728 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content73% 
IMG OID645041258 
Productprotein of unknown function DUF1023 
Protein accessionYP_003201502 
Protein GI258652346 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.053243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0210416 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCT CCGGCGGGTC GTTGGCCCGC CTGGCCATCG CCGACACGGC CCGGCGCCTG 
GGGCATGCGG ACCCGGTCGC GCTGGTCGAC GAGTTGGCCG AGGTCGACGT GCGGGGGCTG
CGGGCCTGGC ACGAGCTCAG CGCCGCCGCC GGGCAATCGC TGTCGACCGG CGCCGATCGG
CTCGGCCGGG CGGTGTCGGC GGTGGCCCAG GCCTGGTCCA GCCCGGTCCC GCGGGCATCG
GTGGACCTTC ATCGGCAGGC CGCACGGGAG GCCCACGAGG TGATCGGCCG TCAGCTCGAC
GTCGCTCTGG ACACGATCGG CACGCTGGAA TCGACCAGGA TCTCGGCGTC GGCCGAGCTG
GACCGGGCCG AGGACCGCAT CGTGGGCACC GGTTGGCCAC CGAGCGAGGA CCTGCTCACC
TGGGCGACCA CCACCGGCTG CCTGCCGACC ATCGCGGCGA CCATCGGCGG CCTGGCCCAG
ACCGTCCAGC AGCTCGGCCG GCGCAACGAC GCGGCGCTGG AAGCGTTGAC CGCGGCCCTG
CGGGAGGACC CGTCGGCACC GGTGGACACC CTGCGGGCCA TCATGCCGGC GGCGTTTCCC
GGTGCGTCCC CGCCCCGTGA ACCCGCCCGC GCGCCCGTCG ACCAGGACAA CCTGGACCGA
CTCGCCGTCG ACCTCCGGTC CAGCGACATC AGCGTGCTCA TCGCCGCCCG TGGCGTCCAG
GCCGCCCTGG ATCAGGCCCG GACGGCCGGC GGGAACGCCC AGCTGCTGGT CTACGAGTCG
GCCAGCTCGT CCAGCCAGGG CCGGGCGGCG ATCAGCATCG GCGACATCAC CACCGCCGAC
AACGTGGCCG CGCTGGCCCC GGGGGTGGGC AGCTCGCCGA CCTCGATGAT CGAAGGCATC
GACGACGCCG TCGCGCTGCG CGATCGCGCG CAGCAGCTCG AGGGGTCCAG CCGGACCGCG
GTGGTCGCCT GGTACGGCTA CGACGTGCCA CTGGCCGCGC TGGGCGGCTC GCCGATGACC
CCCGGAGCGA CCGTGGCCGA CCTGGCGACC ACGGTGAACG ACATGGCCGC CCGGGCGGGC
GGCGGCCAAC TGGTGCAGGA TCTGGACACC TTCCGGCAAT GGGCGCCGCC GGACGCCCGG
TTCATCGGCA TCGGCTTTTC CATGGGGTCG ACCACGGTGT CGGCCGCGGC CGCGCGCCGC
GCGGGGTTCG ACGACCTGGT CATGCTGGGC TCACCCGGGG CCAGCGTCGA GGTCGAGACG
GCGGACGACT ATCCGGGGAT GACCCCGGAC CACGTGTGGG TGGGTTCGCT GGACAACGAC
CCCATCACCA AGGGCATCAC CGATGGTGCC GCCGAGCTGC TCAACGGGCT CGGTCTCAAT
CCCTTCCAAC CCACCCCGTT CGGACCCGAC CCGGCCGACG CCGACTTCGG GGCCCGGGTG
ATGGACCTGA CCTCGAACGC CCCGGACGTT TCGGTACAGC TGGGCGGACC GTTCGGACTG
CTGACCTCGG CAGCGGCCAA CGAGATGCTC GACCTGCAAC TGAATCATCA GCAGGGCAAT
TACCTGTCGG GACCGAGCTT GGACGCGGTG GCCGCGGTGG TCGTCGGCGA CTACGACGCG
GTACCACTGC GGCCGGGCCG CTGA
 
Protein sequence
MTGSGGSLAR LAIADTARRL GHADPVALVD ELAEVDVRGL RAWHELSAAA GQSLSTGADR 
LGRAVSAVAQ AWSSPVPRAS VDLHRQAARE AHEVIGRQLD VALDTIGTLE STRISASAEL
DRAEDRIVGT GWPPSEDLLT WATTTGCLPT IAATIGGLAQ TVQQLGRRND AALEALTAAL
REDPSAPVDT LRAIMPAAFP GASPPREPAR APVDQDNLDR LAVDLRSSDI SVLIAARGVQ
AALDQARTAG GNAQLLVYES ASSSSQGRAA ISIGDITTAD NVAALAPGVG SSPTSMIEGI
DDAVALRDRA QQLEGSSRTA VVAWYGYDVP LAALGGSPMT PGATVADLAT TVNDMAARAG
GGQLVQDLDT FRQWAPPDAR FIGIGFSMGS TTVSAAAARR AGFDDLVMLG SPGASVEVET
ADDYPGMTPD HVWVGSLDND PITKGITDGA AELLNGLGLN PFQPTPFGPD PADADFGARV
MDLTSNAPDV SVQLGGPFGL LTSAAANEML DLQLNHQQGN YLSGPSLDAV AAVVVGDYDA
VPLRPGR