Gene Namu_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3701 
Symbol 
ID8449320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4064111 
End bp4065334 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content69% 
IMG OID645042762 
ProductIntegrase catalytic region 
Protein accessionYP_003202998 
Protein GI258653842 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.25457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.162574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGGATT GGGCTCTGAT CCGGCGGCTG GTGGCGGACG GTGTTCCGCA GCGCCAGGTC 
GCACGGGACC TGGGCATCGG GCGGGCGACG GTGGCGCGGG CGTTGGCTTC GGACCGGCAA
CCGAAGTACG AGCGGCCAGT GGTGCCGACC TCGTTCACAC CGTTCGAACC GGCGGTGCGT
CAACTGCTGG CCACGACGCC GGACATGCCG GCCACCGTCA TTGCCGAGCG GGTCGGTTGG
ACCGGGTCGA TCACCTGGTT CCGCGACAAC GTGCGGCAGC TGCGGCCTGA ACACCGGCCG
GCCGATCCTT CGGACCGGTT GATCTGGCTG CCCGGCGATG CGGCCCAGTG CGACCTGTGG
TTCCCGCCGA AGAAGATTCG GCTCGAGGAC GGCAGCAAGA CGCTGCTCCC GGTCATGGTG
ATCACCGCAG CCCACTCGCG GTTCATGGTC GCCAAGATGA TCCCCACCCG CCACACCGCC
GACCTCCTGC TGGCGATGTG GCTGTTGCTG CAACTCCTGG GCAGGGTCCC GCGCAGGCTG
ATCTGGGACA ACGAGTCCGG CATCGGCCGC GGCAAGCGCC ACGCTGAAGG TGTGGGCGCG
TTCACCGGCG CCCTGGCCAC CACCCTGATC CGGCTCAAGC CCTACGACCC CGAATCGAAA
GGCGTCGTGG AACGCAGGAA CGGTTACTTC GAGACCTCCT TCATGCCCGG CCGCGACTTC
ACGTCGCCGG CCGACTTCGA CGCCCAGTTC ACCGACTGGC TCACGATCGC CAACGCCCGA
GTGGTGCGCA CCATCAAGGC CCGACCCATC GACCGGCTCG ATGCAGACCG GGCGGCGATG
CTGCCCCTGC CACCAGTGCC GCCAGCGGTG GGTTGGATCA ACCGAGTCCG GCTGGGACGC
GACTACTACG TCCGCGTCGA CAGCAACGAC TACTCCGTCG ACCCGGCAGT GATCGGCCGG
TTCGTCGACG TCACCGCCGA CCTGGCACGA GTCCAGGTCC GCCACGAAGG ACGCCTCGTC
GCAGCCCATG AACGAGTGTG GGCCCGCGGA CAGGTCGTCA CCGACCCCGC CCACGTCGCG
GCCGCGAAGG CGCTGCGCGA GCAGCTCCAA CTGCCCCGAC CAGCACCCGG CCACCACGAC
GAACTTGCCC GGGACCTGGC CGACTACGAC CGCGCCTTCG GGCTCATCAC CGGCGGCCTG
ACCGACGGCG AGGAGGTGGC GTAA
 
Protein sequence
MEDWALIRRL VADGVPQRQV ARDLGIGRAT VARALASDRQ PKYERPVVPT SFTPFEPAVR 
QLLATTPDMP ATVIAERVGW TGSITWFRDN VRQLRPEHRP ADPSDRLIWL PGDAAQCDLW
FPPKKIRLED GSKTLLPVMV ITAAHSRFMV AKMIPTRHTA DLLLAMWLLL QLLGRVPRRL
IWDNESGIGR GKRHAEGVGA FTGALATTLI RLKPYDPESK GVVERRNGYF ETSFMPGRDF
TSPADFDAQF TDWLTIANAR VVRTIKARPI DRLDADRAAM LPLPPVPPAV GWINRVRLGR
DYYVRVDSND YSVDPAVIGR FVDVTADLAR VQVRHEGRLV AAHERVWARG QVVTDPAHVA
AAKALREQLQ LPRPAPGHHD ELARDLADYD RAFGLITGGL TDGEEVA