Gene Namu_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1049 
Symbol 
ID8446645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1158713 
End bp1160674 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content72% 
IMG OID645040187 
Producthypothetical protein 
Protein accessionYP_003200446 
Protein GI258651290 
COG category 
COG ID 
TIGRFAM ID[TIGR02243] conserved hypothetical protein, phage tail-like region 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGC CCGCACCGAA TCTGGACGAC CGTACCTTCC AGGACATCGT CGACGAGGCC 
AAACGGCTGA TCCCCCGGTA CACGCCGGAA TGGACCAACC ACAACCTGTC CGACCCCGGG
GTCGCCCTCA TCGAGCTGTT CGCCTGGATG AGCGAGATGG TGCTGTACCG GGTCAATCAG
GTGCCGGACC GGCTGTACGT GCACTTTCTG AACCTGGTCG GGATCGACCC CTTCCCGCCA
TCGGTGGCCC GGGCGGACGT CACGTTCTGG CTGTCCGCGG CCCAGGACGC GGTGGTCACC
GTGCCCGAGG GCACCCAGAT CACCACCGCC CGGGACACCA TGGCCGAACG GGCGATCGTG
TTCACCACGG TGGACCGGCT GGACATCCGG CCGCCCGAGC TGGTCGCCGC GATCACCACC
GACGCGCGGA CCGAACGGCT CACCGACGTG ATCGACGATC TGCGCTACGA GGGCTCGTCG
GTGACCTGCT TCAGCACGGT CGACCGGACC GGAGCCCTCG TGCCGGGCGA CGCGCTGCTG
CTCGGTTTCG CCCGGTCGCT GGCCGGGATG GCCATCCGGC TGTTCGTCTC GGCGGTCGCC
AAGGGCATCG GGGTGGACCC GCAGCGTCCA CCGCTGGCCT GGGAGGTGTG GAACGGCGAG
GCCTGGATCG CGGTGGACGT GTTCACCGAC ACCACCGGCG GGCTCAACCG CTCCGGCGAG
ATCGTGCTGC TCGTGCCCGG CGAGCACGAG TCGCTGACCC TGGGCGAGAC CAGCTCCTAC
TGGTTGCGGG TGCGGCTGAT CCCGGCCCGG GCCGGCCAGC CCACCTACCA GGAGGCGCCG
CGGATCGACG ACCTGCGCGC GGAGGCCATC GGGGCGACCG TGCGGGCCGA GCACGCCTCG
CCATCGCCCG CGGAGGTCCT CGGGCGTTCC GACGGCAGCC CGGGCCAGGA GTACCGGGTC
AGCTTCCCGC CGATCCTGCC CCGCCGCGCC GGCGAAGGCG TGCGGGTGAC CGACACCGGC
GGGTCGGTGG AATGGACCGA GGTGGAGGAC TTCTCCCGGT CCGGGCCGGG CGACCGGCAC
TTCGTCTGGG ATTCCGCCTC CGGCGAGGTC CGGTTCGGGC CGCGGATCCG GTACGCCGAC
GGATCGGTCC GCCAGCACGG CATGATCCCC CGGGACGGTG CCGAGATCGC CGTCACCGGC
TACCGTTTCG GCGGCGGGGC GGCCGGCAAC GTGGGGGCCC GGACGCTCAC CGCGATGCGT
ACGTCGGTGC CGTTCGTGTC CGGCTGCGTG AACCTGCGGG CGGCCACCGG TGGGGTCGAC
GCGGAGACCG TGGCCGAGGC CAAGGCCCGC GGCCCGATGA CCCTGCGCAC CGGCCAACGC
GCCGTCACCG CCGGCGATTT CGAGCGGCTG GCGCTGGAGT CCTCGGTCGA GGTAGCCCGG
GCCCGCTGCC TGCCGTCGGC GACCGGGCGG GGCCCGGTGC GGCTGCTGGT GGTGCCGGCC
GTGCGCACCG ATCCCAAGGC GCAGCAGCTC GACGACTACG CGCTGGCCGC CCCGCTGATG
CGCACGATTA CCGATCACCT GGACCGGCAC CGCATCGTCG GCACCGCCAT CGAGGTGGGA
ACCCCGTACT ACCAGGGGGT GTCGGTGGCC GCGCTGGTCC ACGCGCCGCC CGGACGGCCG
CTGGCCCTGG TCCGCCAGCG GGCCATCGAC GAGCTGACCC GCTACATCAA TCCGCTGACC
GGCGGCGCGG ACGGGGCCGG CTGGTTGTTC GACGTCGACC TGAACGCGGC CGCCATCGCC
CAACTACTGG AGACCGTCGA GGGGGTCGAG CGGGTCGATG AGGTGCAGCT GTTCGAGTTC
GACCTGCGCA CCCGTCAGCG GGTCGGCTCC GGCCGCGACG TCATCCGGCT GGACCGGCAC
TCGCTGTTCC TGTCCGGGAA CCACCGGGTC GTCGTGCGAT GA
 
Protein sequence
MALPAPNLDD RTFQDIVDEA KRLIPRYTPE WTNHNLSDPG VALIELFAWM SEMVLYRVNQ 
VPDRLYVHFL NLVGIDPFPP SVARADVTFW LSAAQDAVVT VPEGTQITTA RDTMAERAIV
FTTVDRLDIR PPELVAAITT DARTERLTDV IDDLRYEGSS VTCFSTVDRT GALVPGDALL
LGFARSLAGM AIRLFVSAVA KGIGVDPQRP PLAWEVWNGE AWIAVDVFTD TTGGLNRSGE
IVLLVPGEHE SLTLGETSSY WLRVRLIPAR AGQPTYQEAP RIDDLRAEAI GATVRAEHAS
PSPAEVLGRS DGSPGQEYRV SFPPILPRRA GEGVRVTDTG GSVEWTEVED FSRSGPGDRH
FVWDSASGEV RFGPRIRYAD GSVRQHGMIP RDGAEIAVTG YRFGGGAAGN VGARTLTAMR
TSVPFVSGCV NLRAATGGVD AETVAEAKAR GPMTLRTGQR AVTAGDFERL ALESSVEVAR
ARCLPSATGR GPVRLLVVPA VRTDPKAQQL DDYALAAPLM RTITDHLDRH RIVGTAIEVG
TPYYQGVSVA ALVHAPPGRP LALVRQRAID ELTRYINPLT GGADGAGWLF DVDLNAAAIA
QLLETVEGVE RVDEVQLFEF DLRTRQRVGS GRDVIRLDRH SLFLSGNHRV VVR