Gene Namu_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2233 
Symbol 
ID8447844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2462589 
End bp2463620 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content74% 
IMG OID645041355 
Producthypothetical protein 
Protein accessionYP_003201599 
Protein GI258652443 
COG category[S] Function unknown 
COG ID[COG3595] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.000729926 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.419962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACCT TCCTGACCCC TGAACCGGTC ACCATCGAGA TCCGCAACTC CGCCGGCTCC 
GTCCTGATCG ACCTGGCCGA CGTCACCACC AGCACCGTCG ACGTCGTCGC CGGGCCCTCG
CACCCGCTGG GGTTCCTGGA CGACGTCATC CGGGCGGCCA AGGCCCAGTT CGTCGGCGCC
CGCTCGGGCG GGCCGGACGC GCACGCCGAC CACGGGTGGG GCGCCGACGT CCCCACCGAC
GATCCCGCCG AGCGGGTCCG GGTGGATCTG CGCCCGGGCG GCTCCGAGGG CGGGGCCAGC
ACGTTGATCG TGGACACCGA TCCCGCCCGG GACGGCTGGA AGTCCTCCTT CACCGTGCAC
GTGACCGCCC CGGCCGGCTC CGGGGTACGG GTGCAGACGC AATCGGCCTC CGGGGTGGTG
AACGGGATCG CCGACCGGGT CGAGGTGCGC ACCGCCTCCG GCGACGTCCG CGTCGACCAG
GTGCTGGGTC GCTCGGTGGT GCAGACCGCC AGCGGCGACG TGACGATCTC CGACACCGCC
GAGTGTGACG TGCGGACCGC CTCGGGTGAC ATCGAGCTGC GCCGGGTCCG GGCCGAGGCG
CTGGTGCATT CGACCTCCGG CGACATCCGG ATCGACGCGG CCGGCCGCGA CGTCAGCGCC
CGCAGCGTGT CGGGCGACCT GCGGTTGCTC GACGTGACCG CCGGCCGGGC CGAGCTGATC
AGCGTCTCCG GTGACGTCGA GGTTGGCGTG CACGCCGGCA CGCTCGCCGC GATCGATCTG
AACACCGTCT CCGGCAGCAC CGCGAACGAC TTCGTGGTCA GCGCCGCCCC GCCGGCCCCG
GAGACGCCAA CCGTCGCCGA CGCGGCCTAC CTGGCCGATG CCGAGTTCGA CGCCGAGGGC
GGCTCCCGCG TGAGCACCGA TGCCGGTTCG GCGGCCGGAC CGCACGCCGG GACCGACGAG
CCGCTGCTGG ATCTGCGGGT CAAGACCACC TCCGGCGACA TCCGCCTGCA CCGCGCCGCC
GCCTCCCACT GA
 
Protein sequence
MHTFLTPEPV TIEIRNSAGS VLIDLADVTT STVDVVAGPS HPLGFLDDVI RAAKAQFVGA 
RSGGPDAHAD HGWGADVPTD DPAERVRVDL RPGGSEGGAS TLIVDTDPAR DGWKSSFTVH
VTAPAGSGVR VQTQSASGVV NGIADRVEVR TASGDVRVDQ VLGRSVVQTA SGDVTISDTA
ECDVRTASGD IELRRVRAEA LVHSTSGDIR IDAAGRDVSA RSVSGDLRLL DVTAGRAELI
SVSGDVEVGV HAGTLAAIDL NTVSGSTAND FVVSAAPPAP ETPTVADAAY LADAEFDAEG
GSRVSTDAGS AAGPHAGTDE PLLDLRVKTT SGDIRLHRAA ASH