Gene Namu_1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1923 
Symbol 
ID8447530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2117764 
End bp2118903 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content74% 
IMG OID645041053 
Productnuclease SbcCD, D subunit 
Protein accessionYP_003201301 
Protein GI258652145 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.0918656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000244242 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGCCTGC TGCACACCTC GGACTGGCAC CTCGGTCGCA CCTTCCACGG GCAGAATCTG 
CTGCCCGACC AGGAAGCGGT GCTCACCGCG CTCGCCGACC TGGCCGCGGA GCACCGGGTC
GACGCCGTCC TGATCTCCGG CGATCTGTAC GACCGGGCGG TGCCGTCGCC GGAGGCCGTG
CAGACCGCGT CCCGGATCCT GGCCCGGATC CGGGCGGCCG GCATCACCGT CGTCGCGATC
GCCGGGAACC ACGACTCGGC GCCCCGCCTG GGCGCCTTCA CCGACTTCCT GGCCGCCGGC
GGCCTGCACC TGGGCGCCGC GGCCGCCGAC GTCGGCACCC CGGCCGTGCT GCCCGATCCC
GACGGCGACG TCGTCATCTA CCCGATCCCG TTCCTGGAAC CGGATCTGCT GCGCTCCGGA
TGGGCGCTGC CGGCCGGATC CGGGCACGAA GCGGTGCTGG CCCGGGCCAT GGACCTGGTC
CGCGCCGACC TGGCCGCGCG GCCGCCGGGC ACCCGATCGG TGGTGCTGGC CCACGCCTTC
GTCGTCGGTG GCCGCGCCGG CGGATCGGAA CGATCGATCG CGGTCGGCGG GGTGGAGTCG
GTCAGCGCGG ACCTGTTCGC CGGATTCGAC TATGTCGCCC TGGGCCACCT GCATCGCCCC
CAGGTGCTGG CCGACCGGAT CCGCTACTCG GGATCCCCCT TGCCCTACTC GTTCTCCGAA
GCCGATCACG AGAAGGGCGT GTGGCTGGTC GATCTGGACG CCGTCGGCGG GGTCAGCGCG
ACCCGGCTGA CCCTGCCGAC GATCCGCCGG CTGGTCTGCC GGCGCGGCCG CCTGGCCGAG
ATCCTGGACA CCGAGCCCGA TCTGGCCGAT GCTTATCTCT CGGTCGAGCT CACCGATCCG
GTGCGGCCGG TGGACCCGAT GCGGCGGCTG CGGGAGGTCC TGCCCTACAC GCTGGTCGCC
ACCTGGGTCG GCGGTTCCCC GGCGCCGGCC GCGTGGCCGG CCGCCCCCGC GGTCCCGACC
GGCCACGACG ACGCCGACCT GCTGCACGAT TTCGTCCGCG ATGCCTGCGG GCGGCCGGCA
TCGACGGCCG AACGCGACCT GCTGGACGAG GCCCTGCGCG CGTTGCGGAT ACCGGCATGA
 
Protein sequence
MRLLHTSDWH LGRTFHGQNL LPDQEAVLTA LADLAAEHRV DAVLISGDLY DRAVPSPEAV 
QTASRILARI RAAGITVVAI AGNHDSAPRL GAFTDFLAAG GLHLGAAAAD VGTPAVLPDP
DGDVVIYPIP FLEPDLLRSG WALPAGSGHE AVLARAMDLV RADLAARPPG TRSVVLAHAF
VVGGRAGGSE RSIAVGGVES VSADLFAGFD YVALGHLHRP QVLADRIRYS GSPLPYSFSE
ADHEKGVWLV DLDAVGGVSA TRLTLPTIRR LVCRRGRLAE ILDTEPDLAD AYLSVELTDP
VRPVDPMRRL REVLPYTLVA TWVGGSPAPA AWPAAPAVPT GHDDADLLHD FVRDACGRPA
STAERDLLDE ALRALRIPA