Gene Namu_3850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3850 
Symbol 
ID8449469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4221542 
End bp4222501 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content71% 
IMG OID645042899 
ProductGlycosyl hydrolase family 32 domain protein 
Protein accessionYP_003203135 
Protein GI258653979 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0183549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.803751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCCGAC TTCCCGATTC CTGGTTGTGG GACTTCTGGC TCGCCCGTGA CGAGCAGACG 
TACCACCTGT TCTTCCTGTA TGCCTCGCGA GCGCTGCACG AGCCCGACCG CCGGCACCTG
CGGGCCGCGG TCGGCCACGC GGTGTCCACC GACCTGGTCC ACTGGGAACG GGTCGCCGAC
GCCATCGTCC GCGACGACGC ACCGGCTTTC GACCACACCG CGACCTGGAC CGGGTCGGTG
GTCCGTGGAC CCGACGGCAG CTGGTCGATG TTCTACACCG GCACCGTCCG CCACGAGGAC
GGGACGCTGC ACCAGCAGGT CGGCCGGGCG GTATCCACCG ACCTGTACCA CTGGCTGAAG
GATCCGCGAA ACCCGTTGGT CAGAGCCGAC TCCCGCTGGT ACGAGACGCT CGGCGGGGCA
GCCCCGTGGG CGGACGAGCA CTGGCGCGAC CCGTGGGTCT TCGCCGACCC GGACGGCGAC
GGCTGGCACA TGCTGATCAC CGGCCGGGCC AACCACGGAC CGCTGGACGA GCGTGGTGTC
GTCGCCCACG CCCGCTCGGC CGATCTGGCC GACTGGCAGG TCGGCCCCCC GCTGTCCGGC
CCGGACGGCG GGTTCGGGCA GATGGAGGTC TTCCAGGTCG AGAACGTCGA CGGCCGTTGG
GTGCTGATCT TCAACTGCCT GGACGGGGAG TTCTCGGCGG CCCGGGCCCG GGCCGGCGGG
CCCGGCGGGA TCTGGGTGGC CGGCGCCGCG TCCGCGTTGG GCCCCTACGA CATCGCCGGG
GCCACCCTGC TCTCGGACGA TCGTTACTAC GTCGGCAAGC TGGTTCGGGA TCCCGACGGG
CACTGGGTGT TGCTGGCCTT CGTCAACAAG GACGAGAACG GCGCGTTCGT CGGGGATCTG
AGCGACCCGA TGCCGGTCGG CTGGGACGCC GACCGGCTGG TGCTGCGGCC CGCCGGGTAG
 
Protein sequence
MFRLPDSWLW DFWLARDEQT YHLFFLYASR ALHEPDRRHL RAAVGHAVST DLVHWERVAD 
AIVRDDAPAF DHTATWTGSV VRGPDGSWSM FYTGTVRHED GTLHQQVGRA VSTDLYHWLK
DPRNPLVRAD SRWYETLGGA APWADEHWRD PWVFADPDGD GWHMLITGRA NHGPLDERGV
VAHARSADLA DWQVGPPLSG PDGGFGQMEV FQVENVDGRW VLIFNCLDGE FSAARARAGG
PGGIWVAGAA SALGPYDIAG ATLLSDDRYY VGKLVRDPDG HWVLLAFVNK DENGAFVGDL
SDPMPVGWDA DRLVLRPAG