Gene Namu_3121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3121 
Symbol 
ID8448735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3441774 
End bp3443045 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content77% 
IMG OID645042202 
ProductDNA protecting protein DprA 
Protein accessionYP_003202443 
Protein GI258653287 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0011897 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.148234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCC TGTCCGCGGA ATCGAGCGGG GAGCTCGAGC CGGAACGGGT GGCCCTGGCC 
GGGTTGCTGC GGGCCTGCGA GCCACCCAGC GCGGCCCTGG TCCGCTACGT CCAGGGGCAC
GGGCCGGAGG CGGCCTGGCA GGCGGTGCTG GCCCGGCGGG CGCCCCGGGC GGTGCTGTCG
GCGACCGCCG CCCGGGTGGA CGGGGTGGAC CCGGCCGTGC TGGTGGCCCG GGCCGAGGCG
GACCTGGTCG CGGCCGCTCG GGTCGGGGCG CGGCTGATCG GGCCGCCGGA CGACCAGTGG
CCGGGCGCCG CGGTGTCGGC GTTCGCCGGG GCGATGGCCC GAGGGGTGCG CGGGGCCGGC
CCGCCGCTGG CCCTGTACGT GCGAGGCCGG CCGCTGACCG GGTTGCCCGA CCGGGCCGTG
ACGGTGGTCG GGTCGCGGGC GTCGAGCCCG TACGGGACGC GGGTGGCCGG CGAGATGGCC
TATGAGCTGG CCCGGGCCGG GTCGGTGGTG GTCTCCGGCG CCGCGTTCGG GATCGACACG
GCGGCCCACC GCTCGGCGCT GCAGGTGGTG CGGTCGGCCC CGGACGAGCC GCGGTCGGTC
ACCGTTGCCG TGCTGGCCTG CGGCATCGAC CGGGCCTATC CGGAGGCCAA CCGCGACCTG
CTCGACCTGA TCGCCCGGCA CGGCTCGGTG GTCAGTGAAT ACCCGCCGGG CACTGTGCCG
GCCCGGCACC GGTTCCTGGT GCGCAACCGG CTGATCGCGG CGTTCGGAGC GGCCACCGTG
GTGGTCGAAG CCGGGCGGCG CTCCGGGACC TTGTCCACCG CGGCCGCCGC CGAACAGTTG
GGACGGATGG TGATGGCCGT GCCGGGGCCG GTGACCTCGG CCATGTCGGT GGGCTGCCAC
CTACTGCTGG CCGACCGGTT CGCGCAGTTG GTCACCGGGG CCGACGACGT GCTGACCGCG
CTGGGCCGGA CGGGCCGGAC CGGGCGGCTG GAGAGCAGAG CCGTGCCCGG ACCGCACGAC
GGGGCGGTGC CGGGGGAGGA CCCACGGCAT CCCACCGACG GCCTGGAGCC GACCAACGCC
CGGGTCTACG ACGCGTTCCC CACCCGCGGG TCGACGTCGG TACACGAACT GGTCGTCGAG
TCCGGGCTGC CGGCGGCCAC CGTGATGGGG GCGCTGGCCG TGCTGCAACT GCACGGTCTG
GCCGACCAGG ACGGACCGCA GTGGCGCCGG GTGCGACCCG ACCGCGGGAC ACCGCCGCGG
CAGATAGGCT GA
 
Protein sequence
MSILSAESSG ELEPERVALA GLLRACEPPS AALVRYVQGH GPEAAWQAVL ARRAPRAVLS 
ATAARVDGVD PAVLVARAEA DLVAAARVGA RLIGPPDDQW PGAAVSAFAG AMARGVRGAG
PPLALYVRGR PLTGLPDRAV TVVGSRASSP YGTRVAGEMA YELARAGSVV VSGAAFGIDT
AAHRSALQVV RSAPDEPRSV TVAVLACGID RAYPEANRDL LDLIARHGSV VSEYPPGTVP
ARHRFLVRNR LIAAFGAATV VVEAGRRSGT LSTAAAAEQL GRMVMAVPGP VTSAMSVGCH
LLLADRFAQL VTGADDVLTA LGRTGRTGRL ESRAVPGPHD GAVPGEDPRH PTDGLEPTNA
RVYDAFPTRG STSVHELVVE SGLPAATVMG ALAVLQLHGL ADQDGPQWRR VRPDRGTPPR
QIG