Gene Namu_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0473 
Symbol 
ID8446054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp523033 
End bp524625 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID645039607 
Productprotein of unknown function DUF1152 
Protein accessionYP_003199881 
Protein GI258650725 
COG category[S] Function unknown 
COG ID[COG4034] Uncharacterized protein conserved in archaea 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGAGAC CGCTCTACGT CGCCGCCGGT GGTGGCGGCG ACGCGCTCGC CGCCACCCTG 
CTGCACCGCG CCTACGGACC GCCCGGGCCG GCCACGATCG CCACCTTCTC CTGGGACCGG
TTGATCGTGG ATCCGCTGCC GGGGCCGCGC AGCTTCTCCA ACTTCAAGGG CCTCGAGACG
GTCGGTCGGC TTGAGCACGT CGTCACACCG CGCACTCGGC CCATCCCGCC GGCCGGCTCG
ACCCTTCCAC CGTTGGCTCG GGACCTGGTC GGCCCGCTCG CCTCGACCCT CGTCCTTCTG
GATCCCACCG ACGGCGCGGC TGGCCTGCGC GAGCAGTTGG CTGCCAGCAT GCAAGCGATC
GACGCGGATG CGCTGGTTGT CGTCGACGTC GGCGGCGATG TGTTGGCGAC CGGTAAGGAA
GCCGGCCTCC GTAGCCCGCT CGCGGACGCT CTCGTGCTCG CGGCCGCTCG TGGGCTCAGT
CCCGACGCCC GAGTTTGGGT CGCCGGCCCC GGGGTCGACG GCGAGCTCAC CGCCGACGAC
GTCGTCTCGC GAGCGCACTC AATCGGCGGG GTCCCGCTTC CACCCTTCCC CTCCGACGTT
GCGGCACTGG CTCTTCCTAT CCTCCGATGG CACCCCTCCG AGGCAACCGC GCTATTCGTG
GCGGCCGCCC AGGGCGTCCG CGGACTTGTC GACATCAGAT CCGGCGGCAT GCCTGTTCAG
CTTGGCGCGG TCAGCTCCGA CGTCTACGAA TGCGGAGTCG ATAGTGCCTT CGAGGTTTCG
CCGCTCGCGG ATTCGGTCGC CGACTCCAGA ACTCTGCTCG ACGCCGAGCA GAGGGCTATT
GAGATCTGCG GGATCTCGGA GATCAGGTTC GAGGCGCGGA AAGCTGGGGC AGCAAGACTG
CGCGACGGGT TTCCACCCGA TGTGCTCGAT GAAATACGTG CCTATGCTGC GGAAGCGCTA
GGTCAAGGAG TTACCTACGC GACCTTCCGC CGGCTGGCCG AGCTGATCAG AATCCGCGAT
CACGCGTCGA TTCAACGCAA TCTCGGCCTG AACCTTCCCG GCGCTATTGA ATCAACATTG
TGCAATCTGG CGGGTTTGAC GTCATCCGGT TCGGCGGTTT GCCTCCCGGC CCTGCCTCGC
CCGGCCGCTG GCAGGGCTTC GATGGATGAC TTGCCGCGTC ACTCCGTGTC CGTGGCCGGC
ATCATTATCG ACGTCGAGGG CCGAATCCTG GTCGTCAAGC GTCGTGACAA CGGCGAATGG
CAGCCGCCTG GTGGCGTCCT CGAGTTGGAC GAAACGATCG AGGAAGGGCT GCGGCGTGAG
GTCCATGAGG AAACGGGAAT CGACGTCCAC ATCGACCGCC TTACCGGTGT GTACAAGAAC
ATGCGCCTTG GTGTCGTAGC GCTCGTCTTT CGATGTCGAC CGAGCGCTGG CTCGCTCCAG
GCAAGTTCCG AAACAGAGGT GGCTCGTTGG ATGTCCGCAC AAGAAGTCGA GTCCACCTTG
TCGCCTGCAT TCGCCATCCG TGTCCGCGAC GCCATCGGCG AAGCTGCCTT TGTCGCGATT
CGGTATCACG ACGGCACTGG GGACGTTCCC TGA
 
Protein sequence
MERPLYVAAG GGGDALAATL LHRAYGPPGP ATIATFSWDR LIVDPLPGPR SFSNFKGLET 
VGRLEHVVTP RTRPIPPAGS TLPPLARDLV GPLASTLVLL DPTDGAAGLR EQLAASMQAI
DADALVVVDV GGDVLATGKE AGLRSPLADA LVLAAARGLS PDARVWVAGP GVDGELTADD
VVSRAHSIGG VPLPPFPSDV AALALPILRW HPSEATALFV AAAQGVRGLV DIRSGGMPVQ
LGAVSSDVYE CGVDSAFEVS PLADSVADSR TLLDAEQRAI EICGISEIRF EARKAGAARL
RDGFPPDVLD EIRAYAAEAL GQGVTYATFR RLAELIRIRD HASIQRNLGL NLPGAIESTL
CNLAGLTSSG SAVCLPALPR PAAGRASMDD LPRHSVSVAG IIIDVEGRIL VVKRRDNGEW
QPPGGVLELD ETIEEGLRRE VHEETGIDVH IDRLTGVYKN MRLGVVALVF RCRPSAGSLQ
ASSETEVARW MSAQEVESTL SPAFAIRVRD AIGEAAFVAI RYHDGTGDVP