Gene Namu_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4226 
Symbol 
ID8449852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4678090 
End bp4681041 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content74% 
IMG OID645043275 
Productconserved repeat domain protein 
Protein accessionYP_003203504 
Protein GI258654348 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.405356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGC CGATCGTGTC CGCACCGGTG GCCGCGGCCG CGCCGGGCCC GGTGTACGGG 
ACATTTACCG ACTTGGCCGG GACCGGCGGC GCCTACCGAG GGACGATGAC GCTGGCCCCG
GGCTTCCCCG CGGCCACGTT CGCCTCCACC TCTCGTTCGG GCGGAGTCGG CCCGCAGTCC
GGGACGACCG CCTGGTTGCC CGCCGGCAGC CCGCCCGGCG TCGTCTACGG ATCAAGCCAG
AACCAGGCCT ACCTCAACCT GCGACCGGCA GCCGACAGCG CCGCCAGCCC GTCCGAGACC
GTCTACACCT TCGCCCGGCC CACCCCGACG GCCGGCTGGA GCTTCATCCT CGGCGACATC
GACGCCGACC AGGTGACGGT CAGCGCGACC ACCGCCGACG GCAGCGCCGT CCCGGTCGCG
GCGTTGGGCT TTGCCGGAGC CTTCAACTAC TGCGACGCAT CGCCCCGGCC GAGCTCGTGC
TCCGGGGTCG GCGCGCCGTA CGACCTGCCC AGTTGGGATC CGGGCACGGC CACGCTGACC
GGCAACTCGG GGGCCAGCGA CACCACCGGC GCCGCCGGTT GGTTCAGCCC GACGGTGCCG
ATCAAGACCC TGACGTTCAG CTACCGGTGG CGCAGCGGCG TCCCGGTGTA CCAGACCTGG
TTCGCCACCC AGACCCGGTC GGTCTCGGGC ACCGTGACCG CCGGCGGCAG CGGGCTGGCC
GGGGTCACCG TCGAGATCGT CGACGGCGGC GGCGCCGTGG TCGGCACCGT CACCACCGGG
GCCGACGGCA CCTACGGCCG GGACGGCCTG GCCCCCGGCA CGTACACCGT CCGGGTCGTC
ACCCCCGACG GGTACGCGCC GGTGGGGCCG AGCCGGCGCC CCGCCGACCT GAGCGCCGGC
GACGCGACCA CCGTCGACTT CGCGCTCGCC CAGGTCGCCG ATCTCTCGGT CATCAAGAAG
CTGGACACCG ATCCGGTCGT CGCCGGCGAG CCCATCACCT ACACGCTGAC CGCCACCAAC
GCCGGCCCGG CGGACGCCAC CGGGGTCAGC GTGGTCGATA CCGTGCCGCC CGGGCTGACC
GGGGTGAGCG GCCAGGTGAC CGGCGGTGCC GCCTGCGTGG TCGACGCCGC CCTGCTCACC
TGCCCGGTCG GTGCACTGAT CGTCGGGACG TCGGCCACCG TGCAGGTCAC CGGCACTGTG
TCGGCGACCG CCCCGGCGGG CGTCGCGCTG CTCAACCAGG CGGCCGTGAC CGCGGACCAG
CCGGACCCGA ACCCCGGCAA CAACCGCGCG TCCGCGGCGG CACGGGTCAC CGCGGCGGCC
GACCTGGTCC TGGTCAAGAC GTTCACCCCG GACAACCCGG TCGCCGGCGG GACGGTCAGC
TACCAGCTCA CCGTCACCAA CAACGGACCG TCGCGGGCCA CCGGGCTGGC CATCGCCGAC
CCGCTGGACC CGGGGGTGAC CGTCGGCACC GTCACCACGA CCGACGGGAC ATGCACCGCG
CCCGGTGGGA TCGTCGCCTG CACCGTGCCG GCCCTGGACG TGGGCGACAG CGTGACGGTC
ACGGTGCCGG TCACCCTGCC CACCGGGTCC ACCCCGGCGC TGCAGAACGC GGCGTCGGTC
ACGGCCGTCA CGCCCGACCC GGATCTGGAA AACAACACCG GGGTGGCGAC CTTCGAACCC
AGCCTCGGGG CCAACCTGGC CCTGATCAAG ACGGCCTCAC CGGCGACCGC CATCCCCGGG
CAGACGATCC AATACCAGCT GTCCGTCTCC AACGGTGGCC CCTCCGACGC CCCGAACGTG
CTGCTCACCG ATTCGATCCC GCTGGGCCTG GACGCGGTCA CGGTGACCGA CGCCGGCGGC
GCCAGCTGCA CCGTCACCGA CCAGGTGAGC TGCAGCTGGG CCTCGGTGCC GGTCGAGGCG
ACCCGGACCA TGACCCTGAC CGGGATCGTC GCCCCGGACG CTCCCGACGG CGCCCTGACC
AATACTGCCG CGGTCACCGC ACCGGTCGAC GAGTCCGACC CCAGCGACAA CACCGCCACC
ACGTCGGTGC TCATCACCTC CGCGGCCGAC GTCAGCCTGA CCAAGACCGC CGGCCCCGAC
CCGGTGGCCC CCGGTGGCAC CGTGACCTTC ACCCTGACCG TGGGCAACGC CGGCCCCCAG
CAGTCCGCGT TGCTGGAGCT GCGCGACCCG ACCCCGGCCG GACTGAGCAT CACCGCCGTC
GACGACCCGG ATTGCCTGAC CGATGCCGTC GCGGTGACCT GCCTGATCGC CGGCCTGGAC
CCCGACGCCA GCCGCACGGT GACGATCACC GGCACGCTGT CGCCGGACTA CGACGGCGAC
GAGCTGACCA ACACCGCGCA GGTGGCCTCG CTGCTCACCG TCGACCCGGA CCCGGCCGAC
AACTCGGCCA CGGCCACCGT CGCGGTGATC ACCCCCGAGC CGCCCGGCTC GAACCTGACC
GTGAGCAAGA CGGCGACCAC GCCGACGGTT GGTCAGGGCG ACCCGGCCGG GTTCGTGGTC
ACCCTGACCA ACCAGGGACC GGCCGACCAG ACCGACGTGG TCATCGCCGA CACCGCCGGC
GACGGCCTGG TGATCGGCTC GGCCACCGGC TCCGCGGGCA CCTGGGACGG CGCCGCGGGC
CTGTGGACCG TGCCGTCCCT GGCCGCCGGG GCCAGCGCCA CCCTCACCGT GTCCGCGACC
GCGACCGCGG TCGGCACCCT GACCAACACG GCGACCCTGA TCAGCTCCGG CCGGCCCGAC
ACCGACCCGG CCGACAACTC CGCGAGCGCC ACGGTCCAGG TCAACCCGAC GGCCGACCTG
TCCCTGACCA AGTCGGTGAC GCCGGCCGGC GGGGCCCCCG GGCAGCCGGT CACCTACCAG
CTCACCGCGA CCAACGCCGG CCCGTCTCCG GCCAGCGGGT GCACGTGGTG GACACGTTGC
CGGCCGGGGT GA
 
Protein sequence
MALPIVSAPV AAAAPGPVYG TFTDLAGTGG AYRGTMTLAP GFPAATFAST SRSGGVGPQS 
GTTAWLPAGS PPGVVYGSSQ NQAYLNLRPA ADSAASPSET VYTFARPTPT AGWSFILGDI
DADQVTVSAT TADGSAVPVA ALGFAGAFNY CDASPRPSSC SGVGAPYDLP SWDPGTATLT
GNSGASDTTG AAGWFSPTVP IKTLTFSYRW RSGVPVYQTW FATQTRSVSG TVTAGGSGLA
GVTVEIVDGG GAVVGTVTTG ADGTYGRDGL APGTYTVRVV TPDGYAPVGP SRRPADLSAG
DATTVDFALA QVADLSVIKK LDTDPVVAGE PITYTLTATN AGPADATGVS VVDTVPPGLT
GVSGQVTGGA ACVVDAALLT CPVGALIVGT SATVQVTGTV SATAPAGVAL LNQAAVTADQ
PDPNPGNNRA SAAARVTAAA DLVLVKTFTP DNPVAGGTVS YQLTVTNNGP SRATGLAIAD
PLDPGVTVGT VTTTDGTCTA PGGIVACTVP ALDVGDSVTV TVPVTLPTGS TPALQNAASV
TAVTPDPDLE NNTGVATFEP SLGANLALIK TASPATAIPG QTIQYQLSVS NGGPSDAPNV
LLTDSIPLGL DAVTVTDAGG ASCTVTDQVS CSWASVPVEA TRTMTLTGIV APDAPDGALT
NTAAVTAPVD ESDPSDNTAT TSVLITSAAD VSLTKTAGPD PVAPGGTVTF TLTVGNAGPQ
QSALLELRDP TPAGLSITAV DDPDCLTDAV AVTCLIAGLD PDASRTVTIT GTLSPDYDGD
ELTNTAQVAS LLTVDPDPAD NSATATVAVI TPEPPGSNLT VSKTATTPTV GQGDPAGFVV
TLTNQGPADQ TDVVIADTAG DGLVIGSATG SAGTWDGAAG LWTVPSLAAG ASATLTVSAT
ATAVGTLTNT ATLISSGRPD TDPADNSASA TVQVNPTADL SLTKSVTPAG GAPGQPVTYQ
LTATNAGPSP ASGCTWWTRC RPG