Gene Namu_4956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4956 
Symbol 
ID8450587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5534389 
End bp5536014 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content68% 
IMG OID645043994 
Productbiotin carboxylase-like protein 
Protein accessionYP_003204218 
Protein GI258655062 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGC AGAGCACCCC GGGCGGGACG GCTTCCGGGT CGGCGCCCTT CGGCCAGCGG 
CCCCTCAAGG GAATCGCCGA GATCCGACAC TTCTTCCGGA CCAACACCAC TCCGGTGTTC
TTCATCGGAG CGACCCCGTT CAACCTGCTC GGCCTGGACC GGTGGGTGCG CAACTTCGAA
TACGTGACCT ACTACGACGC GTGGGACGGC GCCCATCCGC GGGTGTTCAC GCCGAAGGAC
AAGCCGTACA TCGAGTTCGA GAGCGGCGAG GAGATCAACA ACTACCTGCT GCAGCACCCC
GAGGTGCGGG CCTACATGCA GCGCAACGGC GGCCGACCCA AGGTCGCCAT GGTCTTCTTC
GACGAGGAGA CCGAGCGGAT CTGCGCCGAG CTCGGCTACG ACCTGATCCT GCCCTCGGCC
GCGTTGCGCG AACACCTGGA CTCCAAGATC GTCACCACCC GGCTGGGCAA CGAGGCCGGC
GCGCCCAGCG TGCCGAACAT TCTGACCTCG GTGGACACCT GGGACGAGCT GGTCAAGGAG
TGCACCGCCG CCGGCCTGGG CACCGACGTG GTGCTGCAGA CCCCATACGG CGACTCCGGC
AAGACCACGT TCTTCCTGAC CAGCCAGGCG GACTGGGACA AGAACGCCGA CGACATCGTC
GGCCAGCAGC AGAAGATCAT GAAGCGGATC AACAACCGCG GCATCGCGGT CGAGGCCGTG
CTGACCCGGC ACGGCACCGT GGTCGGGCCG TTCATGTCCG AGCTAGTCGG GCACGCCGAG
CTGACCCCGT ACAAGGGCGG CTGGTGCGGC AACGAGATGT ACCCGGAGGT GCTGACCGGG
CTGCGCCGGG AGAAGGCCAC CCAGCTGGTG CGCCGGCTCG GCGACCGGCT CGCCGGGGAG
GGGTACAAGG GCTTCTTCGA GGTCGACGTG CTGGTCGACC TGGACTCCGA AGAGGTCTAC
CTGGGCGAGC TGAACCCGCG GATCAGCGGG GCGTCCTCGA TCACCAACGT CACCGCCGGC
GCCTACTCGG ACATGCCGCT GTTCCTGTTC CACCTGCTGG AGTTCATGGG CGTCGACTAC
GAGCTGGACG TCGACGAGAT CAACCGCCGC TGGGAGGATC TCGCCTCGGT CGACGTCTGG
TCGCAGATGG TGATCAAGGA GATCTCACCC GAGGTCGAGC TGATCACCCA GGGCGCCCGG
ACCGGGCAGT GGTACTTCGA CGACGACGGC GAGCTGCATT TCCGCCGGGC CGCGCTGGAC
TGGCACCAGC TGCAGCGGGA GAGCGAGTGC TTCTTCCTGC GCATCTTCGG CGCCGGCGAC
TATCGCTGGA AGGGCGCCGA TCTCGGCGTC CTGGTGACCA AGGGCCGCCT GCAGACCGAG
GAGTCCGACG CGGTGACGAC GGCCGAACCC GAGGTCGAGG CCGCGCCACC GGACCTGATC
TCCGGCTCGC AGTCCGACGC GCAGCCGGTC GGCATCGAGT CCGCGCCCCA GCCGGCCGGC
AAGCCCCGGC TGACCGCGCG GGCCCGCCGG CTGATCGACG CGATCCGCGA CCAGTACGCG
GGGGTGCCGG TCGGCGCCGA GTTCGTCGCG CCGGCCACCA GCGTCGGGGT CAAGTCCGGC
GGCTGA
 
Protein sequence
MSEQSTPGGT ASGSAPFGQR PLKGIAEIRH FFRTNTTPVF FIGATPFNLL GLDRWVRNFE 
YVTYYDAWDG AHPRVFTPKD KPYIEFESGE EINNYLLQHP EVRAYMQRNG GRPKVAMVFF
DEETERICAE LGYDLILPSA ALREHLDSKI VTTRLGNEAG APSVPNILTS VDTWDELVKE
CTAAGLGTDV VLQTPYGDSG KTTFFLTSQA DWDKNADDIV GQQQKIMKRI NNRGIAVEAV
LTRHGTVVGP FMSELVGHAE LTPYKGGWCG NEMYPEVLTG LRREKATQLV RRLGDRLAGE
GYKGFFEVDV LVDLDSEEVY LGELNPRISG ASSITNVTAG AYSDMPLFLF HLLEFMGVDY
ELDVDEINRR WEDLASVDVW SQMVIKEISP EVELITQGAR TGQWYFDDDG ELHFRRAALD
WHQLQRESEC FFLRIFGAGD YRWKGADLGV LVTKGRLQTE ESDAVTTAEP EVEAAPPDLI
SGSQSDAQPV GIESAPQPAG KPRLTARARR LIDAIRDQYA GVPVGAEFVA PATSVGVKSG
G