Gene Namu_1924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1924 
Symbol 
ID8447531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2118900 
End bp2121887 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content77% 
IMG OID645041054 
ProductDNA repair exonuclease, SbcC 
Protein accessionYP_003201302 
Protein GI258652146 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0557877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00069537 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGGTTGC ACGAGCTGGA GCTGTCGGCG TTCGGACCGT TCGCCGGCAC CGAGACCGTC 
GACCTGGACG CGGTCAGTGC GGACGGCCTG TTCCTGATCC ACGGCGACAC CGGGGCGGGC
AAGACCTCCC TGCTCGATGC GGTTGCCTAC GCCCTGTTCG GCCGGGTCCC CGGTCCCCGC
AACGAGGCTC GCCGGCTGCG CTGCGACCGC GCGCCGGCGG ACGTGGTGAC CCAGGTCCGG
CTCGTGGCCA CCCTCGGCGG TCATCGGGTC GAGATCATTC GCCGGCCCGA GTACCTGCGT
CCCAAGGCCC GGGGCAGCGG CAGCACCTTG CAGCGCGGCA AGGTGTCGCT GCGCTGGCTC
GACCGCACTC CCGCCGGGGC CCGGCCCGAG GGGCTGACCC GGGTCGATGA GGTGGGCGAC
GCGGTGATCG ACCTGCTCGG CATGTCGGCC GATCAGTTCT TCCAGGTGGT GCTGTTGCCG
CAGGGCGAGT TCGCCCGTTT CCTGCGGGCC GACACGGCCG AGCGGGGCGA TCTGCTGGAG
CGGCTGTTCG ACACCCAGCG CTTCGGCCGG ATCGAGGACT GGTTCGCCCA GACCCGCCGG
GTGGCCGGGC AGCGCCTGCG CGAGTGCGAC GACCACATCC GGGAAATAGC CGCGCGGGTG
GCCGAGGCGG CCCGGGTCGA GGCCGCGCCC GAACCAGACG ACCGGTGGTT GGCCGGCCTG
CGCGACCGGT TGGCCGACCG CGCCGAGCTC GCCCGGGAGG CTGCGCAGGA CGCCGCCCGG
CAGCGCGAGG TGTCGGCGGC GGTCCTGCGG GCCGCGACCC AGCGAGCCGC CCGGACTGCG
CGGTTGGCCG AGTTGCGCCG CCGGTTGGTC GATCTCGAAC GGCGGGCCCC CGAGATCGAG
CAGGATCGTC GTCGTCTTGA TGCGCACACC CGGGCCGGGC CGGTGGTCGC CGCGGCCCGG
GCGCAGCAGG CGGCCGGCGA GGTGCGGGCG GACCGCCGGC AGCAGCGGGT GGCGGCCGGC
CGTCGCCTGC ACGCGCTGGC GGAGGCCGCC GCCGACCCCG ACCGGTTGGA CCTGGGTCTG
CTGGCCGACG ACCCGGTCGC CATCCGGGCC GCCGCCGGGG TGGACCGGGA TCGCGCCGGC
GCGCTGATCC CGCTGGTCGG TGAGGCGCAG GAGCAGCAGC GGGATCAGGC GGCGCTGGCC
GCGGCCCGGC GGCGGCACCA CCGCGACGAG AACGCCGCGG TCGACGTCGA GCAGCGGCTC
GCGGCCTTGC CGCCGCGGCT GGCCGAGCTG GATCGGCGGG TCGACCTGGC CCGGTCGGCC
CGCGACCGAT TGCCGGCGGC GCAGGCTGAG CTGGCAGCGG CCGAGCAGAT CCGGGAGGCC
GCGGTCGCGG TGCCGGAGCT GGCGGAGCGG CGACGGCGGG CCGAGGCCGC GGCGGTCGCG
GCCACCGACC GGCATCAGGT CGCGGTGGAC GAGCGGCAGG CCCTGGTGCA GCGGCGGATC
GACGGCATGG CGGCCGAACT CGCCGGCACG CTCAGCGCCG GCGACGGCTG CCCGGTGTGC
GGATCGGTCG AGCACCCGAG CCCGGCCCGG CCCCTGGCCG CGCCGGTCGA CGCGGGCCTC
ATCGAGGCCG CCCAGGTGCG GGAGTGGCGG GCCGCGGCCG ACCGGGACTC CGCCACGGGT
CGCCGGTGGG AGGCGCAGAA CGAGCTGGCC GTGGCGCAGG AGCGCGCCTG CGGCCGGCTG
GCCGAGGCCG CCGAGGCGCA GGTGCGCACG CACCGCGCGA CGGTCGACCG GCAGACGCGG
ATCGCCGCCA ACCTCGATGC GCTGGTCGGG CTGCGGGAGG CGGCGGCCAA GTCCCTGTTC
GAGCAGGAAC TCCGGCGTGA CGAGCTGGGC ACCCTGGTGG CCCTCGGCGC GGCCGAGATC
GCCTCCCTGG CGGCCCGGGT GGATCAGCGA GCGGCCCGGC TGCGGTCGGC CCGGGGCGCC
CACCCGTCCA TCCGGCAGCG CCGGGAGTAC CTGCTGGCCC GGGCAGCGGC CCTCGACGCG
GTGGCGCAGG CCTGCGCCGC CGTCACCGAC GCCGACGCCC AGGACGAGCG GGCCCGGGCC
ACGGTCACCC GGTTGCTGGC CGACTTCGGG TTCGGCGACC TGGCCGAGGT GACCGCGGCC
GCCGACCTCG ATGCGCCGCG CCTGGCCGAG CGGATCCGCA CCGCCGAGGT CGACGCGGCC
GCGCTGCGGG CGCAGTTGGC CGACCCGGAA CTGGCCGGCC TGGACGAGTC GGACCGGGTC
GACGTCGCGG CGGCGGAGGC GGCGGCCGCC GCCGATGCCC GGCGGGCGCA GGCGGCCCAG
CAGCAGGCCC TGGTCCTGGA CGACCGGTTC CGGCAGGTGG CGACGGCCGC GCACCGGCTG
GTGGCGGCGT GGAGGGCGGC CGCACCGGTC CGCGCGCAGG AGCGGCAAGT CGCCGTGCTG
ACCGAGGTGT TGCTGGGCCG GGGGGAGAAC GCGCTGGGCA TGACGCTGCG CACCTACGTG
CTGGCGCACC GGCTGGCGCA GGTGGCGCAG GCGGCCACCG ATCGGCTGGC CCGGATGTCG
GCCGGGCGGT ACTCGTTCGT GCACCGCACC GATCGCGAAT CGCGGGGCCG CGCCGGCGGT
CTCGGGCTGG AAATCATGGA CGGCTGGTCC GGGCTGGTCC GCCCGGCCAA GACGCTCTCC
GGCGGCGAGT CGTTCCTGGC CTCGCTGGCC CTGGCCCTGG GTCTGGCCGA CGTGGTGGCC
GCCGAGGCCG GTGGTCGCCA GCTCGACACC CTGTTCATCG ACGAGGGTTT CGGCAGCCTC
GATCCGGACG CGCTGGATCT GGTGATGGCC ACCATGGACG AGTTGCGGGC CGGCGGGCGG
GTGGTCGGCG TGGTGTCGCA CCTGGACGAG CTGCGGCTGC GGATCCCGCG GCAGATCCGG
GTCGACCGCA CCCCCCAACG CTCGACCCTG GCGGTGGTGG GCCAGTGA
 
Protein sequence
MRLHELELSA FGPFAGTETV DLDAVSADGL FLIHGDTGAG KTSLLDAVAY ALFGRVPGPR 
NEARRLRCDR APADVVTQVR LVATLGGHRV EIIRRPEYLR PKARGSGSTL QRGKVSLRWL
DRTPAGARPE GLTRVDEVGD AVIDLLGMSA DQFFQVVLLP QGEFARFLRA DTAERGDLLE
RLFDTQRFGR IEDWFAQTRR VAGQRLRECD DHIREIAARV AEAARVEAAP EPDDRWLAGL
RDRLADRAEL AREAAQDAAR QREVSAAVLR AATQRAARTA RLAELRRRLV DLERRAPEIE
QDRRRLDAHT RAGPVVAAAR AQQAAGEVRA DRRQQRVAAG RRLHALAEAA ADPDRLDLGL
LADDPVAIRA AAGVDRDRAG ALIPLVGEAQ EQQRDQAALA AARRRHHRDE NAAVDVEQRL
AALPPRLAEL DRRVDLARSA RDRLPAAQAE LAAAEQIREA AVAVPELAER RRRAEAAAVA
ATDRHQVAVD ERQALVQRRI DGMAAELAGT LSAGDGCPVC GSVEHPSPAR PLAAPVDAGL
IEAAQVREWR AAADRDSATG RRWEAQNELA VAQERACGRL AEAAEAQVRT HRATVDRQTR
IAANLDALVG LREAAAKSLF EQELRRDELG TLVALGAAEI ASLAARVDQR AARLRSARGA
HPSIRQRREY LLARAAALDA VAQACAAVTD ADAQDERARA TVTRLLADFG FGDLAEVTAA
ADLDAPRLAE RIRTAEVDAA ALRAQLADPE LAGLDESDRV DVAAAEAAAA ADARRAQAAQ
QQALVLDDRF RQVATAAHRL VAAWRAAAPV RAQERQVAVL TEVLLGRGEN ALGMTLRTYV
LAHRLAQVAQ AATDRLARMS AGRYSFVHRT DRESRGRAGG LGLEIMDGWS GLVRPAKTLS
GGESFLASLA LALGLADVVA AEAGGRQLDT LFIDEGFGSL DPDALDLVMA TMDELRAGGR
VVGVVSHLDE LRLRIPRQIR VDRTPQRSTL AVVGQ