Gene Namu_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3858 
Symbol 
ID8449477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4228965 
End bp4230014 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content74% 
IMG OID645042906 
Productcobalamin synthesis CobW domain protein 
Protein accessionYP_003203142 
Protein GI258653986 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0342119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCCG TGGCGATTCT GGCGACCGTC GACCCCGTGC TGCGCGACGC CGCTCTGCTG 
TCCCTGCTCA CTGATCTGCC CGGCACCGGG GTGCTGGCCC AGGACCTGGA TCCCGACACC
GGCACGTTGC GCCGGATCGT CAGCGACCAG CACGGCATCG CCGAGGACAG CACCCGGCCG
TTGGCGCACG CCTGCCTGGG CTGCGCGATC CGGGAGGACT CGGTGCCCAC GTTGGAGTCC
ATGGCCGCGG CCAGGCGGTG GGAGCGGATC ATCTGGGCGC TCCCGGTCTC GGCCGAGACC
GCCCCGGCGG CCCGGCCGCT GTGCCGACCG GATGCCGTGC CCGGCCTGGA GCTGGCCACC
GTCGCCTGCG TCGTCGACGC CGACCAGGTC GAGGCCGATC TGATGGGGGA CGAGCTGCTG
GCCGACCGGG ATCTGGCCCT GTCGGCCGAC GACCGGCGGT CGGTCGGCGA GGCCTCGGCG
GCCCAGCTCG GGCACGCCGA CCTGGTCCTG ACCATCGGTG AGGACCCGGT CGGGTTGACC
CTGGCCGACC ATCTGCGCGG CCGCCGCACC CTGCGCTCCA CCCTGTTCGG CATCCGCGCC
GAGCAGGTGT TCGCCCCCCG GCACTCGGCC CGGCACGCCG AGGCGCGGAT CGATCCGTGC
CGCATCCAGG CTCCGGATGC CCCGGACGCG CATGGGGTCT GGAGCCTGGA CCTGCTCAGC
CCGCGCCCGG TGCACCCGGG CCGGTTCCTG GCCGGGATCG GTGAGCTGGC CGGTGGCCGC
ACCCGGTCCC GGGGCCGGTT CCACCTGCTC AGCCGGCCGG GACGGGTGGC CGTGTGGGAC
GGGGCCGGGC GTCAGCTGTC CATCGGCGAC GGTGGTCCGT GGCGGGTCGG CACCCCGTCC
ACCCGCATCG TGTTCACCGG GGTGGACGAC GACCGGGCCC GAGTGGCCCA AGGTTTCGCT
CGGATGCTGA TGACCGACGA CGAGCTGGCC GGGTCGATGC GAGTCCGCCA CGAGGACGAC
GGACTGGACG GCTGGCTGGG CGCCCGCTGA
 
Protein sequence
MIPVAILATV DPVLRDAALL SLLTDLPGTG VLAQDLDPDT GTLRRIVSDQ HGIAEDSTRP 
LAHACLGCAI REDSVPTLES MAAARRWERI IWALPVSAET APAARPLCRP DAVPGLELAT
VACVVDADQV EADLMGDELL ADRDLALSAD DRRSVGEASA AQLGHADLVL TIGEDPVGLT
LADHLRGRRT LRSTLFGIRA EQVFAPRHSA RHAEARIDPC RIQAPDAPDA HGVWSLDLLS
PRPVHPGRFL AGIGELAGGR TRSRGRFHLL SRPGRVAVWD GAGRQLSIGD GGPWRVGTPS
TRIVFTGVDD DRARVAQGFA RMLMTDDELA GSMRVRHEDD GLDGWLGAR