Gene Namu_4385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4385 
Symbol 
ID8450011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4868232 
End bp4869428 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content73% 
IMG OID645043432 
Productprotein of unknown function DUF195 
Protein accessionYP_003203661 
Protein GI258654505 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACAT CCGGCTTGCT CCTGGGCGCG CTGCTGCTGG CGGTCGGTGC CGCGATCGGC 
TTCGTCGTCG GGCTGAGCGT GGGTCGAGCC CACCCACGGG ATCAGCGCTC GGCGGCGGAG
CTGGACGCGT TGCTGGCGCC GGCCTCCGAC GCGATGCAGC GGGTGGAGCA GCACCTGCAC
GAGGTGGAAC GGGATCGCGC CGCCGCCTAC GCGGGCCTGC GCGAGCAGGT GTCGGCCCTG
CATCTGACGT CCGCCGATCT CAACCGGCAC ACCCGGGCCC TGGCCGGGGC GCTCAGCGCG
CCGCAGGTGC GGGGGCGATG GGGAGAGATG CAGTTGCAGC GGGTGGTCGA GCTGGCCGGG
ATGGTCGAGC ACTGCGACTT CGACACCCAG GTCGGCGTGC GCTCGGACGA TCCGGAGGCG
GCCGGCGTCC GCCCGGACAT GATCATCCGG TTGGCCGGGG GACGCCAGAT CCCGGTCGAT
GCCAAGGTCC CCTTCGCCTC CTACCTGGAG GCGGCCGAAT GCACCGACGA GCGGCGGCGG
TCCGCACTGC TGGCCGCGCA CTCCCGCGCG CTGCGCAGCC ATGTGGACGC GTTGGCGGCC
AAGGCCTACT GGCGGCACTT CCAGCCGGCC CCCGAGTTCG TCGTGCTGTT CGTACCGGGT
GAGCCGTTGC TGGACGCCGC CTTGGCCGTC GACCCGGGTC TGGCCGACTA CGCCTTCGCC
CGCAACGTCG TCATGGCCAC CCCGACGTCG CTGATCGGCC TGCTGCGCAC GGTCGCGCAC
GTCTGGCGGC AGGAGCGGTT GTCGGCCTCG GCAGCCCAGG TGCACGAGCT GGGTCGGGAT
CTGCACCGCC GCCTGGCCAC CCTGGCCGGG CACCTGACCA CCCTGGGCGC CAGCCTGGAT
AAGGCGGTGC GCGCCTACAA CGGCACGGTG CGCTCGCTGG AGTCGCGGGT CCTGGTCTCG
GCCCGCAAGC TGGCCGATCT GGGGGTCACC GGGGAGGAGC TACCCAGTCC GGCCCAGGTC
GAGACCACCA CGTTGGTGCC CCAGGAGGCG GCCTACGCTC GCGCCGCGGA CCCGCTGGAC
GCGGCCGCCG AGTTCCAGGC GCAGGCCGCC GTGCAGCGGG AGGCGGCCGA GGCGGCGTAT
GCCGCGCGTG TGCACACATC CCCTTTACCG ATCCGGGACG CTACCGTTGA CCGGTGA
 
Protein sequence
MDTSGLLLGA LLLAVGAAIG FVVGLSVGRA HPRDQRSAAE LDALLAPASD AMQRVEQHLH 
EVERDRAAAY AGLREQVSAL HLTSADLNRH TRALAGALSA PQVRGRWGEM QLQRVVELAG
MVEHCDFDTQ VGVRSDDPEA AGVRPDMIIR LAGGRQIPVD AKVPFASYLE AAECTDERRR
SALLAAHSRA LRSHVDALAA KAYWRHFQPA PEFVVLFVPG EPLLDAALAV DPGLADYAFA
RNVVMATPTS LIGLLRTVAH VWRQERLSAS AAQVHELGRD LHRRLATLAG HLTTLGASLD
KAVRAYNGTV RSLESRVLVS ARKLADLGVT GEELPSPAQV ETTTLVPQEA AYARAADPLD
AAAEFQAQAA VQREAAEAAY AARVHTSPLP IRDATVDR