Gene Namu_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1920 
Symbol 
ID8447527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2113818 
End bp2114891 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content73% 
IMG OID645041050 
Productmucin-associated surface protein (MASP) 
Protein accessionYP_003201298 
Protein GI258652142 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.023187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000212154 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCAGGA GCACCGGACC GCGGTCCGCC GCACAGGATT TCGCGGCCAA GGCCGGCGAG 
ATCGCGGAAC TGGTTGCCGA CAAGGTGGCC GACGCCACCA GCGAGGCGGC CAGCAAGGCT
GCGCACCTCG CGGGTGGGGC GGCGCACGCC GCGACGCCGT ACGTCGAGAA GGCCGGTGAG
CAGCTCGAGG AAACCGCCGC CAAGCTGGGT CGCCGGGCTC GCCGGGCCGC GAAGAAGACG
GCCAAGAAGT CGGCTGCGGC AACCACCAAG CAGGCCAAGA TCGTGCAGGC CAAGGCGACC
AAGCAGGCCA AGCGGGGCCA GATCAAGGCC GCCGACAAGG CCGCCGAGAT CGCGCACGTC
GCGCAGGAGA AGGGCCAGGC CACCCTGGTG CAGGCGCTGA CCGCGGCCAG CGCGCAGGCG
GCCAAGGCGA GCACCGCCGC CGACAAGGCG GCCAAGAAGA CCGCCAAGGC CGGTAAGAAG
TCGCACAAGC TGCGCAACCT GATCATCATC GGGGCCGTCG CCGGTGGCGG CGCCTACGCC
TTCTCCAAGC TGCGCTCCGG CGGGGCCGCA CCGGCGGACA CCGCGCCGGC GCCGACCTAC
ACCCCGCCGA CCAGGCCGGC CGACAAGCCC GCCGAGAAGG CGGCCGACAA GCTGGCGGAG
GCCAAGGACA AGGCGGCCGA GGCGTTGGGC ACGGCCAAGG ACAAGGCCGC CGACGCGGTC
GACGCCGCTA AGGACAAGGC CGCCGACGTG ATCGACGCGG CCAAGGACAA GGCGGCCGAC
GCGGTCGACG CGGCCAAGGG CGCCGTCGAC AAGGCCGCCG ACACCACCGC GGACAAGGCC
GCCGACGCGG CTGACGCGGT GGCCGACAAG GCTCCCGAGG CCGCCGACAA GGTTGCCGCC
ACCGCCGATC AGGCCGCCGA CAAGGTCGCC GAGTCCGCGG ACAAGGCCGC CGATAAGGCC
TCGGCTTCCG GTCGCAACCT GGCCGACGCG GCCGACAAGG CCCGGGAAGC GGTGAAGAAG
GCCGCGGCGG AGACCGCCGA AGCGGCGAAG AAGAAGGCGG GAGAGACAGC GTGA
 
Protein sequence
MSRSTGPRSA AQDFAAKAGE IAELVADKVA DATSEAASKA AHLAGGAAHA ATPYVEKAGE 
QLEETAAKLG RRARRAAKKT AKKSAAATTK QAKIVQAKAT KQAKRGQIKA ADKAAEIAHV
AQEKGQATLV QALTAASAQA AKASTAADKA AKKTAKAGKK SHKLRNLIII GAVAGGGAYA
FSKLRSGGAA PADTAPAPTY TPPTRPADKP AEKAADKLAE AKDKAAEALG TAKDKAADAV
DAAKDKAADV IDAAKDKAAD AVDAAKGAVD KAADTTADKA ADAADAVADK APEAADKVAA
TADQAADKVA ESADKAADKA SASGRNLADA ADKAREAVKK AAAETAEAAK KKAGETA