Gene Namu_4604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4604 
Symbol 
ID8450232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5124077 
End bp5125132 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content75% 
IMG OID645043645 
Productprotein of unknown function DUF214 
Protein accessionYP_003203872 
Protein GI258654716 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3127] Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.644197 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCGTCG CCTGGCGCGA CCTGCGGGCT GCCCGCGGCC GGTTCGCGCT CATCGCGGGC 
GTCGTTGCCC TCATCACCGT CCTGGTCGGC TTCCTGACCG GGCTCACCGG CGGCCTGGCC
GCCCAGAACG TCTCGGCCGT GCTCGGCCTG GACGCGACCC GGATCGTCAC CGCCGCCGGG
GCGCAGTCCT TCGCCGACTC GTCGCTGACC GACACCCAGA CCGCCGAATG GACCAGCCGG
GCCGCCCCGG CCCAGGTCAG CCCGCTGGGC ATCAGCCAGC TGCGAGCCGC ACACGGCGCC
ACCAGCACCG GGGTCGCCCT GTTCGGCGGG CCCAGCACCC TGGACCCGGC CGTGCCGGCC
GCCGACGGCA CCGTCTCGCT CTCCGCCCCG GCGGCCGCCG CGCTCGGCGC CGCGGTCGGT
GACCCGGTCG AGATCGCCGG CGCCACCTAC ACCGTCGCGG CCATCACGCC CGACGCCTGG
TACTCGCACA CCCCGGTGGT GTGGACGACC CTGGCCGACT GGCAGCAGAT CAGCCGGACC
CTCGGATCCG GTGGCAGCGC GGCGACGGCG CTGATCGTCC GCGGCGACGC CGACGTCGCG
GCCATCGATA CGGCCACCGG CACCCAGTCC GCCGGACCGC TGCAGGCGCT GCCGCAGATC
GGCGCGTTCC GCTCGGAGAT CGGCTCCCTC GGATTGATCA TCGGACTGCT GCTGGCCATC
TCGGCCCTGG TGGTCGGCGC GTTCTTCGTC GTCTGGGGCA TGCAGCGCCG GGGCGATGTG
GCGATCCTCA AGGCCCTGGG GGCCTCCACC GGCTCGCTGC GCCGGGACAG CATCGGCCAG
GCCGCCGTGG TGCTGGCCCT CGGCGTCGGC GTGGGCACCG CCGTGGTCGC CGCCGTCGGG
TCGGCGATGC CGGCCGCGGT GCCGTTCCTG CTCACCCCGC TGACCATCTT CGGACCGGCC
GCCCTGCTCG TCGTGCTCGG CCTGATCGGC GCGGCCGTCG CGCTGCGCCC GGTCACCGCC
GCCGACCCCC TCACCGCACT CGGGAGCAAC CGATGA
 
Protein sequence
MFVAWRDLRA ARGRFALIAG VVALITVLVG FLTGLTGGLA AQNVSAVLGL DATRIVTAAG 
AQSFADSSLT DTQTAEWTSR AAPAQVSPLG ISQLRAAHGA TSTGVALFGG PSTLDPAVPA
ADGTVSLSAP AAAALGAAVG DPVEIAGATY TVAAITPDAW YSHTPVVWTT LADWQQISRT
LGSGGSAATA LIVRGDADVA AIDTATGTQS AGPLQALPQI GAFRSEIGSL GLIIGLLLAI
SALVVGAFFV VWGMQRRGDV AILKALGAST GSLRRDSIGQ AAVVLALGVG VGTAVVAAVG
SAMPAAVPFL LTPLTIFGPA ALLVVLGLIG AAVALRPVTA ADPLTALGSN R