Gene Namu_5109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5109 
Symbol 
ID8450740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5695898 
End bp5699113 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content71% 
IMG OID645044144 
Productcytochrome P450 
Protein accessionYP_003204368 
Protein GI258655212 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCC CCATCTCCGT CGATCAGCTG CCCGGCCCGC ACGGTGTGCC GATCCTGGGC 
AACCTGCTGG ATCTGAACAA CCCGCACCCG ATCGACACGC TGATGGACTG GGCCCGCCAG
TACGGCCCGA TCTACAAGCT GACCGTTCCC GGCACCACCC GGATCGTCGT CTCCGGTGCC
GACCTGATGC CCGACATCTG CGACGACGAA CGGTTCGACA AGCAACTCGG GCCCGGGCTG
GTCGCCGCGC GGGGCACCGG GACGCCGGGG CTGTTCCTGT CCGAGACCAG CGATCCGCTC
TGGCGGCGGG CGCACAACAT CCTGATGGCC CCGTTCAGCC AGTCCAGCAT GCGCGGCTAC
CTGCCGCGGA TGGTCGACAT CGCCGGCCAG CTGATGGACA AGTGGTCGCG GCTCAACCCC
GATGATGAGG TGAACGTGCC GGCCGACATG ACCGCGCTCA CCCTGGACAC GATCGCCCTG
TGCGGGTTCG GCTACCGCTT CAACTCGCTC TACCGGGATA CCCCGCACCC GTTCGTGGCG
GCCATGGTCC GCAACCTGCT GGAGGCGCAG AAGGAGGCCA AGGAACTGCC GCTGCAGCGC
AAGCTCCGGG TGCAGGCCCG CCGGCAGGCC CGCGAGGACA GCGAGTTCCA GATCAACCTG
GTCAAGGGTC TGATCGAGGA CCGCCGGCGG CAGGGCGACG CGGCCGACAA CACCGACCTG
CTCGGCCGGA TGCTCACCGG GGTGGACAAG TCCAGCGGCG AGGGGCTGCC GGACGACAAC
ATCATCGCCC AGTGCATGAC GTTCCTGGTG GCCGGCCACG AGACCACCAG CGGCCTGCTC
TCGTTCGCCA TCAACTACCT GATGAAAAGC CCGCAGTACA TCGACCAGGC GCGGGTGCAG
ATCGACGAGG TGCTCGGGGA CACCGCCGAG CCGACGTACG AGCAGGTGCA CCAGCTGACC
TTCGTCCGGC AGATCCTGGA CGAGTCGCTG CGGCTGTGGC CGACCGCGCC GATGTTCACC
CGGGCGGCCC GGACGGACAC CGTCATCGGC GGGAAGTACC TCGCGCCCAA GGATGTCGGG
ATCTCGGTGC TGCTGCCGAT GTTGCACCGC GACCCCAGCG TGTGGGGGCC GGACGCCGAG
GACTTCAACC CGCATCACTT CGACCCGGAG CGGTTCGCCG CGGTGCCGCC GCTGGCCTAC
CGGCCGTTCG GCACCGGGCT GCGCGCGTGC ATCGGTCGCC AGTTCGCCCT GCAGGAGGCC
ACTTTGGTGC TCGGCATGCT GCTGCAGCGC TTCGACATCA TCGACCACCG CAATTACCAG
CTGCACACCC GCGCGACCCT GACCGTCAAG CCGGAGGACA TGTGGATCCG GCTGCGGCCG
CGGGAGGGCT TCGCCGGAGT GGCGCGGGCG CAGGCCGCGG TGACGGGTGC GGCCGACGAA
GGGCCGGCCG AGGCCGACGC GTCCGCGGTG GCGACGGCGC ACGGCACGCC GTTGCTGGTG
CTGTTCGGCT CCAACCTGGG CACGGCCGAG GGCATCGCCA ACCGGCTGGG ACGGGAGGGC
GCCGACCGTG GGTACGCGGT GACGGTGGCC GCTCTGGACG ACCATGGACC CGACCTGCCG
GCCGAAGGTG CGGTGCTGGT GGTCAGTGCC TCCTACAACG GCGAGGCGCC GGAGAACGCG
GCCGCCTTCG TGGACAAGCT GCGCAGCCGG GCGGTGCCCG ACGGGGCATA CGCGGGCGTG
CGGTTCACGG TGTTCGGCTG CGGCGACACC GACTGGGCGG CGACCTATCA GGCGGTGCCG
ATCCTGCTGG ACGCCGAGCT GGAACGCTTG GGCGGCACCC GGATCCACCC TCGCGGCGCC
GGAGATGCGC AGGCCGACTT CGACGGCCAG TACCGGGCCT GGCACGCCGA CCTGTGGGCG
GATCTGGCCG CCGGGCTGGG CCTGTCGGAG CGGCAGGCGC AGGTGACGGC GGCCGGGCCG
CGGCTGACGA TCGCCACGGT GAACCGCCAG CTGACCAACC CGGTGGTCGT GTCCTACGAC
GCGACGCCGA CCCTGGTCAC CCGCAACGTC GAGCTGACCG CGACCGGGGC GGCCGGGGTG
CGCTCGACCC GGCACGTTGA GGTCGCCCTG CCGGCCGGGC TGAGCTACCG GGCCGGCGAT
CACCTGGGCG TGCTGCCCCG CAACAACCAG GCCCAGATCC GCCGCGTGAT GCGACGTTTC
GGCCTGGACA TGGGCACCTA CGTGACGATC ACGGCGAACA GTGGCACGCA CACCCACCTG
CCGGTCGACG AGCCGTCGCC GTTGCTGGGC GTGCTCGGGG CCTGCGTCGA ACTGCAGGCG
ACGGCCACCC GGGCCGACCT GGAGGTGCTG GCCGAGCACA CCGACGACCC GGCGCAGCAG
GCGGCGTTGC GGGCGCTGAC CGACGACGAG ACCTACCGGA CGCAGGTGCG TGAGCCGAAC
CTGTCGGTGC TCGACCTGCT GGAGCGCTAT CCGGCCTGCG CGCTGCCGTT CCCGGTGTTC
CTGGATCTGC TGCCGGCCCT GGCCCCGCGG TACTACTCCA TCTCCTCCTC GCCGCTGGCC
AGTCCGGACA CGGTGTGCGT GACCGAGGGC GTGCTGGCCG AGCCGGCCCG ATCCGGCGCC
GGCCGGTTCG AGGGGGTCTG CTCGACCTAC CTGGCGTCGA TGGACGCCGG CAGCACGGTG
TTCGTGTTCA CCCGGGAGCC GACCATCCCG TTCCGCCCGC CGGCCGACCC GAGCGTGCCG
ATGATCATGG TGGGGGCCGG GACCGGGCTC GCCCCGTTCC GCGGGTTCCT GCAGGAACGC
GCCGCCCAGG GTGCCGACGG GGCCGCGCTG GCCCCGTCCC TGCTGTTCTT CGGCTGCCGG
ACCCGCGACG ACCGGTTGTA CGAGCAGGAG CTGGCCGACT TCGCGACGAG TGCGAGTGTG
CAGACTTACA CGGCATTCTC GCGCGAGCCG GGCCAGCAGC GGCGCTACGC CCAGCACGAG
ATGCTGGCCC ACGCGGACGA GATCTGGTCG CTGCTCGAGG CCGGCGGGGT GGTGTACGTC
TGCGGCAACG CCCGTACCCT GGCGCCCGGC GTGCGCGCCG CCCTGACCCA GATCGCCGCC
GACAAGCTCG GTCTCGGCGG CGCCGCGGCC GAGGACTGGC TGACCGACCT GCGCCGCCAG
CACCGGTACC TGGAGGACAT CTGGGGTGCC CGCTGA
 
Protein sequence
MTAPISVDQL PGPHGVPILG NLLDLNNPHP IDTLMDWARQ YGPIYKLTVP GTTRIVVSGA 
DLMPDICDDE RFDKQLGPGL VAARGTGTPG LFLSETSDPL WRRAHNILMA PFSQSSMRGY
LPRMVDIAGQ LMDKWSRLNP DDEVNVPADM TALTLDTIAL CGFGYRFNSL YRDTPHPFVA
AMVRNLLEAQ KEAKELPLQR KLRVQARRQA REDSEFQINL VKGLIEDRRR QGDAADNTDL
LGRMLTGVDK SSGEGLPDDN IIAQCMTFLV AGHETTSGLL SFAINYLMKS PQYIDQARVQ
IDEVLGDTAE PTYEQVHQLT FVRQILDESL RLWPTAPMFT RAARTDTVIG GKYLAPKDVG
ISVLLPMLHR DPSVWGPDAE DFNPHHFDPE RFAAVPPLAY RPFGTGLRAC IGRQFALQEA
TLVLGMLLQR FDIIDHRNYQ LHTRATLTVK PEDMWIRLRP REGFAGVARA QAAVTGAADE
GPAEADASAV ATAHGTPLLV LFGSNLGTAE GIANRLGREG ADRGYAVTVA ALDDHGPDLP
AEGAVLVVSA SYNGEAPENA AAFVDKLRSR AVPDGAYAGV RFTVFGCGDT DWAATYQAVP
ILLDAELERL GGTRIHPRGA GDAQADFDGQ YRAWHADLWA DLAAGLGLSE RQAQVTAAGP
RLTIATVNRQ LTNPVVVSYD ATPTLVTRNV ELTATGAAGV RSTRHVEVAL PAGLSYRAGD
HLGVLPRNNQ AQIRRVMRRF GLDMGTYVTI TANSGTHTHL PVDEPSPLLG VLGACVELQA
TATRADLEVL AEHTDDPAQQ AALRALTDDE TYRTQVREPN LSVLDLLERY PACALPFPVF
LDLLPALAPR YYSISSSPLA SPDTVCVTEG VLAEPARSGA GRFEGVCSTY LASMDAGSTV
FVFTREPTIP FRPPADPSVP MIMVGAGTGL APFRGFLQER AAQGADGAAL APSLLFFGCR
TRDDRLYEQE LADFATSASV QTYTAFSREP GQQRRYAQHE MLAHADEIWS LLEAGGVVYV
CGNARTLAPG VRAALTQIAA DKLGLGGAAA EDWLTDLRRQ HRYLEDIWGA R