Gene Namu_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1103 
Symbol 
ID8446699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1225507 
End bp1226709 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content59% 
IMG OID645040240 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_003200499 
Protein GI258651343 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0442932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGAA TCGAACCTCG CACTCTGCCC AGTCTGCTCA CCTTCATCGT CGATAACCGA 
GGCCGCTCGT GTCCCACCGA GGCGGAGGGC TTTCCTTTGA TCGCGACAAA CTGTGTCAAA
GACGATAGTC TTTATCCGGT GTTTGAGAAC GTACGGTACG TTTCGCAGGC CACCTACCGC
GATTGGTTTC GCAGTCATCC AGAGCCGGGT GACATCGTCT TCGTATGTAA GGGTTCGCCG
GGACGAATCG CGATGGTTCC GGACCCAGTG CCGTTTTGTA TCGCGCAAGA CATGGTGGCG
TTGCGTGCGA ACTCTCGAAT CGTGAATCCG CACTATCTAT ACTATGCCCT CAAGAATCAG
GAGGTGCGCG CTCGAATCGA GAATATGCAT GTCGGGACAA TGATTCCTCA CTTTAAGAAG
GGCGATTTCG GAAAGCTGCA CCTGGACGTT CATGTCCGAT TGTCCGATCA GATGGCGATC
GCGGAGGTGC TGGGCGCGCT CGACGATAAG ATCGCCGGCA ACTCCAAGAT GGCTTCGACT
GCTGGCGAAT TGGCTACGGA GTGCTTCCGT GATGTATCTA TTGATGCAAC GTTCGATGAA
ACGACTTTTG AGAAGGTGGC CGCTATCGGT GGAGGTGGAA CACCTTCCAC GAAGGTGCCC
GGGTATTGGG ACGGGCCCAT CGCCTGGGCA ACGCCGACTG ACCTGACGGC ACTGCCTGGC
CCGTACTTGG AGCGCACGGC TCGATCGATC ACCCTGTCTG GTCTGGACAA CTGCGCGTCG
GCACTCTTCC CGCGGGGCGC CATTCTGATG ACCTCGCGCG CTACGATCGG AGCCTTTGCG
ATTGCGCAAC GGCCGGTGGC GGTCAATCAA GGATTCATTG TCGTCGTTCC CGAAGATCCC
CAGATGAAAT GGTGGCTTTT CCACACGATG CGTGACCGGG TCGACGAGTT CATCTCGCAT
GCTAATGGGG CGACCTTCCT GGAGCTGAGT CGGGGTAGGT TCCGAAGCCT TCCGGTTCGG
GTCCCCGCCG GGCGTGTCCT GCGGGCTTTC GATGAGCGAG TGGAGGCAAT CCATGCGGTA
GCGCGACACG CACTGGTCGA AAATACGGAG TTGGCCGAAC TTCGCGACAC TCTCCTCCCG
CACCTCATGT CCGGCAGGCT CCGCGTCAAG GACGCCGAAA AGCAGGTGGA GGCGGTGGTG
TAG
 
Protein sequence
MNGIEPRTLP SLLTFIVDNR GRSCPTEAEG FPLIATNCVK DDSLYPVFEN VRYVSQATYR 
DWFRSHPEPG DIVFVCKGSP GRIAMVPDPV PFCIAQDMVA LRANSRIVNP HYLYYALKNQ
EVRARIENMH VGTMIPHFKK GDFGKLHLDV HVRLSDQMAI AEVLGALDDK IAGNSKMAST
AGELATECFR DVSIDATFDE TTFEKVAAIG GGGTPSTKVP GYWDGPIAWA TPTDLTALPG
PYLERTARSI TLSGLDNCAS ALFPRGAILM TSRATIGAFA IAQRPVAVNQ GFIVVVPEDP
QMKWWLFHTM RDRVDEFISH ANGATFLELS RGRFRSLPVR VPAGRVLRAF DERVEAIHAV
ARHALVENTE LAELRDTLLP HLMSGRLRVK DAEKQVEAVV