Gene Namu_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1104 
Symbol 
ID8446700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1226710 
End bp1227816 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content66% 
IMG OID645040241 
Productprotein of unknown function DUF1016 
Protein accessionYP_003200500 
Protein GI258651344 
COG category[S] Function unknown 
COG ID[COG4804] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.41725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCGA ACGACCTCCC AGATCGCACT GGCTTCCCGC CGACCCCGAG CCACGTCGGC 
CTGCCCGGGT GGTACCCAGA ACTGCTGGAC TCGGTCGCCG GTCGGATCAC CGCTGGCCGG
CAGCGGGCGA CCGGGGCGGT CAACCGGGAG CTGGTCCTGA GCTACTGGGC GATCGGCCGG
GACATCCTGG ACCGGCAGGA GCAGGAGGGC TATGGCACCA GGGTCATCGA CCGGCTCTCG
GCCGACCTCA AAGGGCGGTT CCCGGACGCT AAAGGGTTCT CGCCGCGCAA CCTGAAGTAC
ATGCGGAAGT TCGCCGAGGC CTGGCCCGAC CCGGCAGTTG TGCAAGGGAC CCTTGCACAA
CTGCCGTGGT GGTCCCAGAT CGCTCTGATG GAGAAGCTGC ACGACCCTGA GCAGCGGCTT
TGGTACGCCG CCGAGGCCAT TGAAGCGGGC TGGAGCCGGG ACATCCTGGC CCTGCAGATC
GACCTCAAGT TGCACGATCG CAAGGGCCGG GCGATCACCA ACTTCGCCGG CACCATGCCG
CCCGCCGACT CGGACATGGC CCAGCAGGCG ACCAAGGACC CGTACGTGTT CGACTTCCTC
GATCTCACCG AGCGCAGCCG AGAGCGGGAG CTCGAGACCG GGCTGGTAGA GCACGTCGGG
AAGTTCCTGC TCGAACTCGG GCAGGGATTT GCCTTCGTCG GCCGGCAGGT GCGACTTGAG
GTGGACGGCG AAGAGTTCTA CTGCGACCTG CTCTTCTACC ACCTGAAGCT GCGGCGATAC
GTCGTCATCG AACTCAAGGC CGTGAAGTTC GAGCCCGGCT TCCTCGGCCA GTTGGGCATG
TACATGGCTG CGGTCGACGA CCTGCTCGCC CACCCGACGG ACGAGCCGAC CATCGGGTTG
ATGCTCTGCA AGGGCAAGAA CGATGTGGTC GCCGAGTGGG CGCTGCGCGG CTACTCCTCG
CCGATCGGCG TCTCCGACTG GACCACCGCA ATTTCCACCG CGCTGCCGGA CGACCTGGCA
TCGAGCCTGC CCAGCATCGA GGAGATCGAG GCCGAGCTGT CCGATCCGAG TTCGAGCCAG
ACGGACAACA GCGACACCAC GGACTGA
 
Protein sequence
MSANDLPDRT GFPPTPSHVG LPGWYPELLD SVAGRITAGR QRATGAVNRE LVLSYWAIGR 
DILDRQEQEG YGTRVIDRLS ADLKGRFPDA KGFSPRNLKY MRKFAEAWPD PAVVQGTLAQ
LPWWSQIALM EKLHDPEQRL WYAAEAIEAG WSRDILALQI DLKLHDRKGR AITNFAGTMP
PADSDMAQQA TKDPYVFDFL DLTERSRERE LETGLVEHVG KFLLELGQGF AFVGRQVRLE
VDGEEFYCDL LFYHLKLRRY VVIELKAVKF EPGFLGQLGM YMAAVDDLLA HPTDEPTIGL
MLCKGKNDVV AEWALRGYSS PIGVSDWTTA ISTALPDDLA SSLPSIEEIE AELSDPSSSQ
TDNSDTTD