Gene Namu_1886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1886 
Symbol 
ID8447493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2071827 
End bp2073083 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content68% 
IMG OID645041016 
ProductIntegrase catalytic region 
Protein accessionYP_003201264 
Protein GI258652108 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0003225 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0562744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCTC GGGCCGAGAT CACCACCAGG TACGCCAAGG CGTTCAAGGC TGCGGACAGG 
CGGACGAAGG GCCGGATCCT GGACGAGGTG GTGTCCGTGA CGGGCTGGTC GCGGGACAAC
GCCCGGCGAC GCTTGACCAG CGCCGCGCAG TGCCCGCCGG GCGGCGGCCG ACAGGTCGCC
CAGCGGCCCA GGAAGCAGCG GGCGAACAAG TTCTCCTACG AGGCCGTAAA GGTCCTCCAG
CGGGTCTGGG CGGCTTCCGG TGGGCAGTGC GGCAAGTACC TGGCCGCATC GATGGACACG
CAACTGGACG GGCTGGAACG GCACGGGGAG CTGGTCGACG GCGAGTGCCG GTACAGCGCT
TCGGTGCGGG CCGAGCTGCT CGCGATGTCG CCGGCGACGA TCGACCGCTA CCTGCGGACC
GCGAAGGCCA CCGACCAGGT CCGCGGTGTC TCGACCACGA AACCGTCACC GTTGCTGCGG
TCCTCGATCA AGATCCGTAA GGCGGGCGAC GAGGTCGAGG CCGAGCCCGG CTTCTTCGAG
GGCGACACGG TTGCACATTG CGGCCCGACC CTGCGGGGCG AGTTCGCTCG CTCGGTGAAC
CTGACCTGTG TGCATACCGG GTGGGTCTTC ACCCGCTCGA CGCGCAACAA CGCGCACGCC
AACATCCTGG CCGCGCTGCA GGCCGGGGTG CAGGAGATCC CTTTCGCGGT CACCGGTCTG
GACTTCGACA ACGGAGGTGA GTTCCTGAAC CGGGCCGTCA TCAAATGGGC CGCCGAGCGA
GACATCTACT TCACCCGGTC CCGGCCGTAC AAGAAGAACG ACCAGGCCAC GATCGAGTCG
AAGAACAACC ACCTGGTCCG CCGGTACGCG TTCTACTACC GGTACGACAC CGACGAGGAG
CGGCACGCGT TGAACCGGCT CTGGAAGCTG GTCAACGACC GGCTCAACTA CCTCACCCCG
ACGATCAAGC CGGTCGGCTG GGGTGAGAAC AAGGCCGGTC GCCGCAAACG CCTGTACGAC
AAGCCGCAAA CCCCGTTGAG TCGGCTGCTG GCCGCCGGCA CGCTGTCGCC GGCGCAAGCC
CACGAGCTGA CCGCCTACCG GGACGGGCTC AACCCAGCCG CGCTCGCCCG TGAGATCGCC
GACATTCAAG CCGTGCTGCT GGGCCTGGCC AAGAACAAGA CCGAGCAGCT CTACCTCGCG
ACCGTCCCCA AGGCACTGCC CGACGTGCGC AAAGGCGTCC GGATCCGGGC CGGCTGA
 
Protein sequence
MASRAEITTR YAKAFKAADR RTKGRILDEV VSVTGWSRDN ARRRLTSAAQ CPPGGGRQVA 
QRPRKQRANK FSYEAVKVLQ RVWAASGGQC GKYLAASMDT QLDGLERHGE LVDGECRYSA
SVRAELLAMS PATIDRYLRT AKATDQVRGV STTKPSPLLR SSIKIRKAGD EVEAEPGFFE
GDTVAHCGPT LRGEFARSVN LTCVHTGWVF TRSTRNNAHA NILAALQAGV QEIPFAVTGL
DFDNGGEFLN RAVIKWAAER DIYFTRSRPY KKNDQATIES KNNHLVRRYA FYYRYDTDEE
RHALNRLWKL VNDRLNYLTP TIKPVGWGEN KAGRRKRLYD KPQTPLSRLL AAGTLSPAQA
HELTAYRDGL NPAALAREIA DIQAVLLGLA KNKTEQLYLA TVPKALPDVR KGVRIRAG