Gene Namu_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1801 
Symbol 
ID8447406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1976258 
End bp1977379 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content75% 
IMG OID645040930 
ProductUspA domain protein 
Protein accessionYP_003201180 
Protein GI258652024 
COG category[T] Signal transduction mechanisms 
COG ID[COG0589] Universal stress protein UspA and related nucleotide-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00612057 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.112863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGC TCGAGGAATC CCGGGCCGTC CACGAGGCGC AGCGTGGGGT GGTCACCCGA 
GCCGGTGGGG TGACCAGCCA TGAGCATGCG CTGCAGCAGT ACCTGGTCGC CGCGCAGTAC
GGCGTGACCC GCGGCGATCG CCCGCAGCCG GACGCCGCCG CCCCGCCCGC GGCACCGCCG
GCCGCCCAGA CCGTCGCCGA CGAGAGTTCG GACCGGACCG ACCGGCGGTC CGAGTACCCG
GCGGGTGCCG TGGTGGTCGG CGTCGACGAC TCGGCCGGCG CCCGGGCGGC GGCTGCGTGG
GGCGCCGACG AGGCGGTGCG GCGACACGCC CCGCTGGTGC TCGTGCACGC CTACCGGTTG
CCGGCGACCG GCGGTTTCCC CGGCTACAAC CCGGTCCCGG ACGATCTGCT CGAACAGCTG
CGGGCCGCGG GCGACCACCT GCTCCGGCGC ATCGGCGAGG AGGTGGCGGG CCGCCACCCG
GACCTGCCGG TCGTCCGCTC GTTGGTCCAC GGCCGCGCCG AGGTGGCGTT GCGGGAGGCC
TCCGGGCAGG CGCGATTGAC GGTCGTCGGC AACGCGCCGT CGTCCCGGGT GGCCGGTGCG
TTGCTCGGTT CGGTGGCCCT GGCCGTGACG TCGTCGAATC CGGTTCCCGT CGCGGTGGTC
CATGCCGGGC ACCAGGTGGC CGACGGACCG ATCGTCGTCG GCGTCGACGG GTCCCCGCTC
AGCGAGGCGG CGGTGGCGTT CGCGTTCGAC GAGGCCGCCC TGCGCGGCGT CGAACTGGTT
GCCGTCCATG CCTGGAACGA TGTCTACCTG GACTCCCGAC GGTTGGAGCC GCTGCTGATC
GATCCGCAGA CCCTGCTGGA GCAGGAGCGG GCTCTGCTCG GTGAACGCCT CGCCGGGTGG
GGGGAGAAGT ACCCGGACGT CCCGGTGCGG CAGGTCCTGC TGCATCAGCG ACCCGTGCAG
GCGTTGCTGG GCTACGCCGA CTCCGCGGGC GCGATGGTCG TGGGCAGCCA CGGCCGCGGC
GGGTTCGCCG GGATGCTGCT CGGGTCCACC GGACACGCGT TGGCCACCCA CGGCCAGTGC
CCGGTGATCG TCGTCCGGAA CGCCGTCGAC CGGGCCCGCT GA
 
Protein sequence
MNPLEESRAV HEAQRGVVTR AGGVTSHEHA LQQYLVAAQY GVTRGDRPQP DAAAPPAAPP 
AAQTVADESS DRTDRRSEYP AGAVVVGVDD SAGARAAAAW GADEAVRRHA PLVLVHAYRL
PATGGFPGYN PVPDDLLEQL RAAGDHLLRR IGEEVAGRHP DLPVVRSLVH GRAEVALREA
SGQARLTVVG NAPSSRVAGA LLGSVALAVT SSNPVPVAVV HAGHQVADGP IVVGVDGSPL
SEAAVAFAFD EAALRGVELV AVHAWNDVYL DSRRLEPLLI DPQTLLEQER ALLGERLAGW
GEKYPDVPVR QVLLHQRPVQ ALLGYADSAG AMVVGSHGRG GFAGMLLGST GHALATHGQC
PVIVVRNAVD RAR