Gene Namu_3156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3156 
Symbol 
ID8448770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3477739 
End bp3479271 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content78% 
IMG OID645042237 
ProductUMUC domain protein DNA-repair protein 
Protein accessionYP_003202478 
Protein GI258653322 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0000974062 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000679979 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCGGA CCATCGTCGT CTGGTGCCCG GACTGGCCAG CGGTGGCGGC CGCCCGGCAG 
GCGAACCGGC CGGCCAGCGA CCCGGTGGCG GTGCTGCACG CCAACCGGGT GCGGGCCTGC
ACCGCCGCCG CCCGGGCGCA GGGGGTGCAC GTCGGCCAGC GGCGCCGCGA CGCCCAGTCC
CGCTGCCCGG ACCTGCTGAT CACCGGCGTC GACCAGGACC GGGACGCCCG GATGTTCGAG
CCGGTGGCCG CGGCGGTGGA GTCCCTGGCC CCCGGGGTGG AGGTGCTGCG CTGCGGAGTG
GTGGCCTGTC CGGCCCGCGG ACCGGCCCGC TACTTCGGCT CGGAGGCCGC CGCCGCGGAA
CGGATCGTGG ACGCGGTCGA GGCGCTGGAC GTCGAGTGCT GCATCGGGAT CGCCGACGAC
CTGGAGATCG CGGTGCTGGC CGCCCACCGG TCGGTGCTGG TCCCGCCGGG GGAGTCGGCC
GCCTTCTGCG CCGGGCTGCC GATCACCGAC CTGTCCCGGG ACCCGGCGAT CGGCCCGCCC
GACCGGGTCG CGCTCACCGA CCTGCTGATC CGCCTGGGCA TCACCACCGC CGGGGCATTC
GCGGCGCTCC CACCGGAAAA GGTGGCCACC CGGTTCGGCG CCGACGGGGT GTGCGCGCAC
CGGTTGGCCC TGGGCCGGCC CGAGCGCGGC CTGTCCCGCC GGCAGATCCC CGAGGATCTG
GTCGTCGAGC AGGAGTGCGA CCCGCCGCTG GACCGGGTGG ACACCGCCGC CTTCGCCGCC
CGGGCCCTGG CCGAGCGGTT CCACGCGCGG CTCGCCGACG CCGGCCTGGC CTGCACCCGG
CTGGTCATCA CCGCCGCCAC CGACCGGGGC GCCACTCTGT CCCGCACCTG GCGCTGCGCC
GCGCCGCTGA CCGCCGCGGC CACCGCGGAC CGGCTGCGCT GGCAGCTGGA CGGCTGGCTC
ACCCACCGTC AGCAGCCCGG CGCGATCACC CGGCTCGCCC TGGAACCGGT CGAGGCGGTC
GGCTCCGGGC ACATCCAGTA CGGGTTGTGG GGCTCCGACG GGCAGGACGA CCAGCGGGCC
GGCTGGGCCT TCGCCCGGGT ACAGGGCCTG CTGGGGCCCG ATTCGGTGCT GTCCCCGGTG
CCGGCCGGCG GCCGGAGCAC CGCGGACCGG GTGGTGCTGG TGCCCTGGGG GGACGAGAAG
GTGAGCCCCC GGGACCCGGC CGCGCCCTGG CCCGGGGCGA TCCCGTCCCC CTCGCCGGCC
CGGGTGAGCG ATACCGAGCC GATCGCCGTG CTGGACGCCG CCGGTGACCC GGTGCGGCTC
ACCGACCGCG GCCGGCTGAC CGGCCCGCCG GCCTGGCTCA GCGGCACCCG CATCGACGCC
TGGGCCGGGC CCTGGCTGCT GGACGAGCAC TGGTGGGCCT CGGGCCGGGA CATCGTGCCC
ACCGCCCGGT TGCAGCTGGT CACCGCCGCC GGCGCGGCGC TGCTGGTGCG TTCGGCCGGG
GACGGCTGGC AGGTCGAGGG GACGTACGAC TGA
 
Protein sequence
MSRTIVVWCP DWPAVAAARQ ANRPASDPVA VLHANRVRAC TAAARAQGVH VGQRRRDAQS 
RCPDLLITGV DQDRDARMFE PVAAAVESLA PGVEVLRCGV VACPARGPAR YFGSEAAAAE
RIVDAVEALD VECCIGIADD LEIAVLAAHR SVLVPPGESA AFCAGLPITD LSRDPAIGPP
DRVALTDLLI RLGITTAGAF AALPPEKVAT RFGADGVCAH RLALGRPERG LSRRQIPEDL
VVEQECDPPL DRVDTAAFAA RALAERFHAR LADAGLACTR LVITAATDRG ATLSRTWRCA
APLTAAATAD RLRWQLDGWL THRQQPGAIT RLALEPVEAV GSGHIQYGLW GSDGQDDQRA
GWAFARVQGL LGPDSVLSPV PAGGRSTADR VVLVPWGDEK VSPRDPAAPW PGAIPSPSPA
RVSDTEPIAV LDAAGDPVRL TDRGRLTGPP AWLSGTRIDA WAGPWLLDEH WWASGRDIVP
TARLQLVTAA GAALLVRSAG DGWQVEGTYD