Gene Namu_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1033 
Symbol 
ID8446629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1139606 
End bp1141087 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content69% 
IMG OID645040171 
Productiron-sulfur cluster binding protein 
Protein accessionYP_003200430 
Protein GI258651274 
COG category[C] Energy production and conversion 
COG ID[COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain 
TIGRFAM ID[TIGR00273] iron-sulfur cluster-binding protein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.768116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGT TCCTGGGCAT CCCGAGAAGA CATGCGGCGC AGGAGGAGTC GCCGTTGCGG 
GGGGCGTTGC CGTTCCCGAA GGCGGCCCGG CGGGAGTTGG CCAACGATCA GTTGCGGCGG
AACCTGGCCC ACGCCACGTC GGTGATCCGG AGCAAACGGG CCCTGGTCGT GGACGAGATG
CCCGACTGGG AAGCGTTGCG TGATGCCGGC GCGGCGACCA AGACGCAGGT GATGTCGAAC
CTGCCCGCGC TGCTGGAACA GTTCGAGGCC AACGTGACCG CCCGCGGCGG CGTCGTGCAC
TGGGCCACCG ACGCCCAGGA GGCCAACCGG ATCGTCACCG AACTGGTCCG CGCGCAGGGC
GTCGACGAGG TCATCAAGGT CAAATCGATG GCCACCCAGG AAATCGGCCT CAACGAATAC
CTCGAGGACA ACGGGATCGC CGCGGTCGAG ACCGACCTGG CCGAACTGAT CGTGCAACTG
GGCCACGACA CGCCCTCGCA CATCCTGGTC CCGGCCATCC ACCGCAACCG CACCGAGGTC
CGCGACATCT TCCTGGCCGA AATGGAAGAC GCCCCGGCCG ATCTGACCGA TGAACCACGC
CGGTTGGCGA TGGCCGCCCG GGAACACCTG CGGCGCAAGT TCCTGGCCAG CAAGGTCGCG
ATCTCGGGCA CCAACTTCGG CATCGCCGAG ACCGGCACCA TCGGCGTGGT CGAGTCCGAG
GGCAACGGCC GGATGTGCGT GACCCTGCCC GAGACCCTGA TCACCGTGAT GGGCATCGAG
AAGATCCTGC CCACCTTCAC CGACCTGGAA GTGTTCCTGC AGCTGCTGCC CCGCTCCTCG
ACCGGGGAAC GGATGAACCC CTACACCTCG CTGTGGACCG GGGTCACCCC GGGCGACGGC
CCGCAGAACT TCCACCTCAT CCTGCTCGAC AACGGCCGCA CCAACGCCCT GGCCGACCAG
GTCGGCCGGG CCGCGCTGCA CTGCATCCGC TGCTCGGCCT GCCTGAACGT GTGCCCGGTC
TACGAACGCA CCGGCGGCCA CGCCTACGGC TCGGTCTACC CCGGCCCGAT CGGCGCCGTG
CTGACCCCGC AACTGACCGG CATGCACGGC CACCACGACG TGAACTCGAC CCTGCCCTAC
GCCTCCTCGC TGTGCGGCGC CTGCTACGAC GTCTGCCCGG TCAAGATCCC CATCCCGGAC
CTGCTCGTCG AACTGCGCGG CCGCGCCGTG GACGCCGACC GCGGCCGCAC CATCCCCGGC
GGCTGGGACG CCGCGATGAA AGCCGCCGCC TGGATCATGA GCGACCCCAC CCGGTTCGCC
GCCGCCGAGA AAGGCCTGGC CGCCGGCCGT CTCGTCGCCG GCCGCGACAA GAAGATCAAA
CACCTGCCGT TCCCCGGATC GGCCTGGACC CACACCAAGG ACATGCCCGC CCCGCCCAAG
CAGACGTTCC GGCAATGGTG GAAGGAAACG CATCATGAGT AG
 
Protein sequence
MSTFLGIPRR HAAQEESPLR GALPFPKAAR RELANDQLRR NLAHATSVIR SKRALVVDEM 
PDWEALRDAG AATKTQVMSN LPALLEQFEA NVTARGGVVH WATDAQEANR IVTELVRAQG
VDEVIKVKSM ATQEIGLNEY LEDNGIAAVE TDLAELIVQL GHDTPSHILV PAIHRNRTEV
RDIFLAEMED APADLTDEPR RLAMAAREHL RRKFLASKVA ISGTNFGIAE TGTIGVVESE
GNGRMCVTLP ETLITVMGIE KILPTFTDLE VFLQLLPRSS TGERMNPYTS LWTGVTPGDG
PQNFHLILLD NGRTNALADQ VGRAALHCIR CSACLNVCPV YERTGGHAYG SVYPGPIGAV
LTPQLTGMHG HHDVNSTLPY ASSLCGACYD VCPVKIPIPD LLVELRGRAV DADRGRTIPG
GWDAAMKAAA WIMSDPTRFA AAEKGLAAGR LVAGRDKKIK HLPFPGSAWT HTKDMPAPPK
QTFRQWWKET HHE