Gene Namu_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2021 
Symbol 
ID8447630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2231539 
End bp2233059 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content77% 
IMG OID645041147 
ProductCarboxylesterase type B 
Protein accessionYP_003201393 
Protein GI258652237 
COG category[I] Lipid transport and metabolism 
COG ID[COG2272] Carboxylesterase type B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000953164 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00956228 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGTGACCG TGACCGCCGG AACCGACGCC GAGCCCGGGT CCGGGCCCCA GGTCCGCACC 
GGCGCCGGGG TGCTCCGGGG CCGGTGGGAC GCCGGCGTGG CTGTCTTCCT GGGCGTCCGG
TACGCCGAGC CACCGGTGGG CACGCTCCGC TTCGCCGCCC CGCATCCGGC CCGTCCGTGG
GCGGGCGTGC GGCCGGCGGT GGCGTTCGGC CCGCCGCCAC CGCAACCCGG TACCGCCGCG
CCGGGAACGG ACTGGCTCAC CCTGAACATC TTCTCGCCCG ACCCGGCGCC GGCCGCCGGC
CTGCCGGTGC TGGTCTGGAT TCCCGGCGGC GGGTACCTGA TCGGCTCGGG CAGCCAACCG
GAGTTCGACG GCGCCGTCCT GGCTACCGGC GGGATGGTCG TGGTGACGGT GAACTACCGG
CTGGGCCTGG AGGGTTTCGG GCTCCTGGAC GGCGCCCCCG CCAACCGGGG TCTGCTGGAC
CAGGTGGCGG CGTTGCAGTG GGTGCACGAG CACATCCGGG CGTTCGGCGG CGACCCGGGC
CGGGTGTGCG TCGTCGGCGA ATCGGCCGGC GGCGGGTCGG TGGCCGCCCT GCTGGCCATG
CCGCGGGCGG CCGGGCTGTT CCGCCGGGCG ATCGCCCAGA GCGTGCCCGG CCCGTTCTTC
TCGGTCGAGC TGGCCGCCGA CATCGCCACC GCCTGGGCCG CCGAACTGGG GGTCCGGCCC
ACGGTCACCG AGCTGGCCGC GATCGACCCC GGCCGCCTGC CGGCCGCCGG CGAGGCCGTC
TCCTCCGCCA TCGGGCGGTG GGCCGACCGA TGGGGACCGA TCTGTTTCCG GCCCATCCCG
ATCGCGCCGG TCGTCGACGG CGACGTCCTG CCGGCCGCCC CGTGGGCGGC CGTGGCCGGT
GGGGCCGGTC GCGACGTGGA CCTGCTCACC GGTCACACCC GCGACGAGCA TCGGCTGTTC
AGCCTGCTCG ACGGGGTTCT GGGCAGCGTG ACCCCGGAGG GGGCCGACGC CGCCCTGCGC
GCGCTGGCGC CGGGCCCGGA CGGTGCCCGC CGCTACCGCG AGGCCTACCC CGGCGCCGGC
CCGGAGCAGC TGTACGAGCT CGTCAACGGC GACTGGCTGT TCCGCATGCC CTCGCTGCAG
CTGGCCCAGG CGCACGCGGC CGCCGGCGGG CGCACCTACC TATACGAGCT GACCTGGCCG
GCTCCGGGCC TGGGCGGAGC GCTGGGCGCC TGCCACGGAT TGGACGTGCC GTTGGTCTTC
GGCACGCTGG ACCGCGGTCA ACCGGCCATG CTCATCGGCG ACCCTCCCCC GGCCGCCGCG
CGCGACCTGT CCGGCTGGAT CCGCCGGGCC TGGACGGCGT TCGCCGCCGA CGGTGACCCG
GGGTGGCCGG CCTTCGACAC CGAACGGTGG CTCACCCAGC TGCTGGACAC CGACCCCACC
GTCACCGGGT ATCCCGAGCG GGTCTCCGCC GAGTTGTGGC GCGAGCACTC CTTCGCGGCC
CTGCCGCTGC TCGGGCGCTG A
 
Protein sequence
MVTVTAGTDA EPGSGPQVRT GAGVLRGRWD AGVAVFLGVR YAEPPVGTLR FAAPHPARPW 
AGVRPAVAFG PPPPQPGTAA PGTDWLTLNI FSPDPAPAAG LPVLVWIPGG GYLIGSGSQP
EFDGAVLATG GMVVVTVNYR LGLEGFGLLD GAPANRGLLD QVAALQWVHE HIRAFGGDPG
RVCVVGESAG GGSVAALLAM PRAAGLFRRA IAQSVPGPFF SVELAADIAT AWAAELGVRP
TVTELAAIDP GRLPAAGEAV SSAIGRWADR WGPICFRPIP IAPVVDGDVL PAAPWAAVAG
GAGRDVDLLT GHTRDEHRLF SLLDGVLGSV TPEGADAALR ALAPGPDGAR RYREAYPGAG
PEQLYELVNG DWLFRMPSLQ LAQAHAAAGG RTYLYELTWP APGLGGALGA CHGLDVPLVF
GTLDRGQPAM LIGDPPPAAA RDLSGWIRRA WTAFAADGDP GWPAFDTERW LTQLLDTDPT
VTGYPERVSA ELWREHSFAA LPLLGR