Gene Namu_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3601 
Symbol 
ID8449220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3953576 
End bp3954799 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content69% 
IMG OID645042672 
ProductIntegrase catalytic region 
Protein accessionYP_003202908 
Protein GI258653752 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.000351603 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0699896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGGATT GGGCTCTGAT CCGGCGGCTG GTGGCGGACG GTGTTCCGCA GCGCCAGGTC 
GCACGGGACC TGGGCATCGG GCGGGCGACG GTGGCGCGGG CGTTGGCTTC GGACCGGCAA
CCGAAGTACG AGCGGCCAGT GGTGCCGACC TCGTTCACAC CGTTCGAACC GGCGGTGCGT
CAACTGCTGG CCACGACGCC GGACATGCCG GCCACCGTCA TTGCCGAGCG GGTCGGTTGG
ACCGGGTCGA TCACCTGGTT CCGCGACAAC GTGCGGCAGC TGCGGCCTGA ACACCGGCCG
GCCGATCCTT CGGACCGGTT GATCTGGCTG CCCGGCGATG CGGCCCAGTG CGACCTGTGG
TTCCCGCCGA AGAAGATTCG GCTCGAGGAC GGCAGCAAGA CGCTGCTCCC GGTCATGGTG
ATCACCGCAG CCCACTCGCG GTTCATGGTC GCCAAGATGA TCCCCACCCG CCACACCGCC
GACCTCCTGC TGGCGATGTG GCTGTTGCTG CAACTCCTGG GCAGGGTCCC GCGCAGGCTG
ATCTGGGACA ACGAGTCCGG CATCGGCCGC GGCAAGCGCC ACGCTGAAGG TGTGGGCGCG
TTCACCGGCG CCCTGGCCAC CACCCTGATC CGGCTCAAGC CCTACGACCC CGAATCGAAA
GGCGTCGTGG AACGCAGGAA CGGTTACTTC GAGACCTCCT TCATGCCCGG CCGCGACTTC
ACGTCGCCGG CCGACTTCGA CGCCCAGTTC ACCGACTGGC TCACGATCGC CAACGCCCGA
GTGGTGCGCA CCATCAAGGC CCGACCCATC GACCGGCTCG ATGCAGACCG GGCGGCGATG
CTGCCCCTGC CACCAGTGCC GCCAGCGGTG GGTTGGATCA ACCGAGTCCG GCTGGGACGC
GACTACTACG TCCGCGTCGA CAGCAACGAC TACTCCGTCG ACCCGGCAGT GATCGGCCGG
TTCGTCGACG TCACCGCCGA CCTGGCACGA GTCCAGGTCC GCCACGAAGG ACGCCTCGTC
GCAGCCCATG AACGAGTGTG GGCCCGCGGA CAGGTCGTCA CCGACCCCGC CCACGTCGCG
GCCGCGAAGG CGCTGCGCGA GCAGCTCCAA CTGCCCCGAC CAGCACCCGG CCACCACGAC
GAACTTGCCC GGGACCTGGC CGACTACGAC CGCGCCTTCG GGCTCATCAC CGGCGGCCTG
ACCGACGGCG AGGAGGTGGC GTAA
 
Protein sequence
MEDWALIRRL VADGVPQRQV ARDLGIGRAT VARALASDRQ PKYERPVVPT SFTPFEPAVR 
QLLATTPDMP ATVIAERVGW TGSITWFRDN VRQLRPEHRP ADPSDRLIWL PGDAAQCDLW
FPPKKIRLED GSKTLLPVMV ITAAHSRFMV AKMIPTRHTA DLLLAMWLLL QLLGRVPRRL
IWDNESGIGR GKRHAEGVGA FTGALATTLI RLKPYDPESK GVVERRNGYF ETSFMPGRDF
TSPADFDAQF TDWLTIANAR VVRTIKARPI DRLDADRAAM LPLPPVPPAV GWINRVRLGR
DYYVRVDSND YSVDPAVIGR FVDVTADLAR VQVRHEGRLV AAHERVWARG QVVTDPAHVA
AAKALREQLQ LPRPAPGHHD ELARDLADYD RAFGLITGGL TDGEEVA