Gene Namu_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1100 
Symbol 
ID8446696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1220234 
End bp1221652 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content70% 
IMG OID645040237 
Producthypothetical protein 
Protein accessionYP_003200496 
Protein GI258651340 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.616137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCA TCGCCACCGG GCCGGCCTCG GCCCACTACG ACACGCACCA CGACGCAGTC 
GTCATCGCGT CACTGTCGAT CACCGAACCG GTGGTCGTCA CCGAGTCGCG CCGGTGGGCC
ACCGGGCACC GCGGCCCCAT CGTCACCCCG GCCGACATGG GCGATGCCGA TCTCGCCGCG
TTCGTCACCC AGGCGATCAC CATCGGCGTG CATGCGATCG CCGGCGCCGG CGGCGTCCAG
GACCAGATCC GCCTCGACAC CCTGGTCGCG CAGGTCGGGG ACCGGACGGC CGAGGCCAGT
AGCAAAGCCG CAGCCGCGAC TGCCGACGCC GTCACGGCCG CCACCTCAGC GATGGAGCGC
GCCTCGACCG AGGCCAGGAA GGCCATCACC GATGCGGGAA TCGCCGCCCG GCGGAGTTTC
TCGGACAACG TGGAGGGCGC CCGCAAGTCG CTGGCTGACG AGGTCGGCCG GCTGCTCGGT
GGTGAGAACC CCGAGCTGTT GGCTCGGCTC GGACCGGTCC TGGACCGCTT CGGTCGGGAC
TTGGACGACC GCGCCACCAA GCAGACCGCC GACCTGATCG AGCGGGTCAC TCGGCAGTTC
GATCCGGCCG ACCCGACCTC GCCGATCGCC AAGCACAATG CCGAGCTCAC CCGGCAACAG
CAGGAGCTCA GCCAGTCTCT GGACAAGAAC CATCGGGAAC TGGAAGCCAA GGTCGAGGAG
CTGACGGCCG CGGTCCGGCA GGCCCACGCC GCCGCGGAGG CAGCCGCCGC CACCGCGAGG
CTCACCACGC TCAAGGGAGG CACCTTCGAG CAGCGAGTCC ACGAGTTAAT GGACGGGATC
GCCGCCGGTC TGGGCGACGA ATACGCCGCC ACGGGCACTC GCCCCGGCGC CGTGCCCCGG
AGCAAGAAGG GTGACGGGGT GCTCGCGGTG GACGGCGGTG CCGTCAACGT CGTCATCGAG
ATGACCGATT CGAAGCGGAC GTCCTGGAAC AGCTACCTGG ACGAGGCCGA ACGCAACCGG
ACGGCGCTGG CGTCGCTGGG TCTGGTCCGC AGCCCCGACC AGCTGGGGGG CCGCACCATC
CAGACGATCA CCGCCCGCCG AATCGTCATG GCCTTCGACC CCGAGTACGA CGACCCGGCC
ATGCTGCGCA CTGTCGTCCA GATGTTGCGG CTGGCGGCAG TCGCCGCGGA TGCGCGGCGG
GAAGACGCGG AGATCGACAC CGCGCGCGAG AAGCTCACCG AAGCAATCGA TCTGCTCGGC
CGCATCGACG AGGTCAAGCG ATTGGCCGGA CTGGTCGGGA GCAACGCGAC CAAGATCGAC
AAGGAGGCGG ACGGACTCCG GTTGGGGCTG GACCGGCTGC TCGGGCAGGC GATGACCGCG
TTGACGGGAG CCTCTGGCAC CGAATCCGCA GCTGCCTGA
 
Protein sequence
MTSIATGPAS AHYDTHHDAV VIASLSITEP VVVTESRRWA TGHRGPIVTP ADMGDADLAA 
FVTQAITIGV HAIAGAGGVQ DQIRLDTLVA QVGDRTAEAS SKAAAATADA VTAATSAMER
ASTEARKAIT DAGIAARRSF SDNVEGARKS LADEVGRLLG GENPELLARL GPVLDRFGRD
LDDRATKQTA DLIERVTRQF DPADPTSPIA KHNAELTRQQ QELSQSLDKN HRELEAKVEE
LTAAVRQAHA AAEAAAATAR LTTLKGGTFE QRVHELMDGI AAGLGDEYAA TGTRPGAVPR
SKKGDGVLAV DGGAVNVVIE MTDSKRTSWN SYLDEAERNR TALASLGLVR SPDQLGGRTI
QTITARRIVM AFDPEYDDPA MLRTVVQMLR LAAVAADARR EDAEIDTARE KLTEAIDLLG
RIDEVKRLAG LVGSNATKID KEADGLRLGL DRLLGQAMTA LTGASGTESA AA