Gene Namu_4472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4472 
Symbol 
ID8450099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4969941 
End bp4971470 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content58% 
IMG OID645043517 
Producthypothetical protein 
Protein accessionYP_003203745 
Protein GI258654589 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGCG GCGTACTCGA CCGCCCTCGT ACCAGCACCG GAACTTGGCA GCCGACCATC 
CTGCGCAAGC ACCATGTAAG CAACGACACG GTGCTGCTGC TGCGGTCCTA TCGGATCACC
ACTTTTGGCA TTCTAATCGT CTTTCTCGCT CTGCAGTTCT TGATCCCCGC GCGGCTGGTC
GTTAGCGGCA TGGGTGCCGC GGGGCGCCCA TCTGTCGCAG TTGGGATCTT GCTGGTCTTC
CTCTGGGCCC TAGCCGCACT GCGTCCAAAA GGGTTGCCTG CAGGTCGGCA GCCGATCCGA
TGGCTAGTCG GCATCTACGT GGCCGTGCAA TTGGCCACAT ATGCGGTCGG ATTCGATCGC
GGACCAACCC AGATCGAGGC CAACAGCGCA GACCGTTGGT TGATATTCAC ATTTGCGATG
GCCGGTGTGG CGCTGGCCGT CTGTGACGGA CTTGTAACAC GACGTCAGCT TGATCTGCTG
CTCCGCGCGA TGGTGGGATT CGCGGCCGTA ATGGCGATCG TCGGAATATT GCAGTACGCC
CGAATTGTAA ATCTCGTCCT CTACATCCGG ATTCCTGGGT TGACGGCCAA CAGTCAATTG
CTCATACAGG GAGCCCGTGG TGACGGTGAT TTCGCGCGGG TGGCCGGAAC AGCGACACAC
TACATAGAAT TTGGCGTAGT TTTGGCCATC ATGCTTCCAC TGGCGTTGCA CTACGCCCTG
TTTTCCAAAC GAGCACGTTG GGCCCGCGTG CTCTCCTGGG TGCAGGTTGG TCTAATCGTC
TCTGCGATTC CAATGTCCAT ATCGCGATCG GCCATGCTAA CCACCGCGGT TGTACTCCTC
CTCATGCTCT TTGTGTGGAA GTGGCGCTTA CGTTACAACG TGATCGTCAT AGGATCGATC
GCACTAGTGA CGTTCCATCT TGTCAACCGC GGCCTATTAG GGACCATCTG GGCGCTCTTC
ACTAATGTCA ACAACGACCC CAGCATTCAG CACCGGCTCT CCGATACGGC CACGGTGGTT
CAGCTTTTTG AATCGCGACC AGTCTTGGGT CGCGGCGCTG GGATGATCAT CCCGGAGCAA
TACTTGCTGC TGGACAACCA ATTCTACGTT ACGTTGCTAG CAAGCGGTGT TGTTGGTGTC
GCCACACTCG CAGCACTCTA CTTGGTTCCC TATTTCCTAG CGCGCAGTAT TCGACTGCGG
ACCCCCTGCG AGGCCGACCG GCACCTCGCG CAAGCGCTGG CTGTAACATT TCCTGCTGCG
ATGTTGGCCG CCGGAACATT CGATGCCTTC TCGTTCGCAA CCTATGTCGG CGTATTCTTT
GTCCTCATAG GGAGCGTGGG GGCGCTATGG AGATTTACCC GTGGCTCCGT GCCAAAGGGC
GCTCCTCAGC CGCTATATTG GTCGCCAGAC GACCGGTTCG TATGCGCGCC GCTCATGGCC
CTTGATCATC CTCGCTGGAG TTCACCGTTC TTGCGGACTT CCGCCGCAAG AACAAAGAGT
GAATCGGAAA AGCAACTAAC GAGAGTGTAA
 
Protein sequence
MSSGVLDRPR TSTGTWQPTI LRKHHVSNDT VLLLRSYRIT TFGILIVFLA LQFLIPARLV 
VSGMGAAGRP SVAVGILLVF LWALAALRPK GLPAGRQPIR WLVGIYVAVQ LATYAVGFDR
GPTQIEANSA DRWLIFTFAM AGVALAVCDG LVTRRQLDLL LRAMVGFAAV MAIVGILQYA
RIVNLVLYIR IPGLTANSQL LIQGARGDGD FARVAGTATH YIEFGVVLAI MLPLALHYAL
FSKRARWARV LSWVQVGLIV SAIPMSISRS AMLTTAVVLL LMLFVWKWRL RYNVIVIGSI
ALVTFHLVNR GLLGTIWALF TNVNNDPSIQ HRLSDTATVV QLFESRPVLG RGAGMIIPEQ
YLLLDNQFYV TLLASGVVGV ATLAALYLVP YFLARSIRLR TPCEADRHLA QALAVTFPAA
MLAAGTFDAF SFATYVGVFF VLIGSVGALW RFTRGSVPKG APQPLYWSPD DRFVCAPLMA
LDHPRWSSPF LRTSAARTKS ESEKQLTRV