Gene Namu_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2082 
Symbol 
ID8447692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2295644 
End bp2296834 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content61% 
IMG OID645041204 
Producthypothetical protein 
Protein accessionYP_003201449 
Protein GI258652293 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00946987 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00350075 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGACAG AGACTCACCA CCTGGATATC AAGCGCGAGA TCGGCGTCAA ACCGGGCGAC 
CGGGCCGAGC TGGCGCGCGA CCTCGCTCAG TTCGCCATCG ATGGTGGCGC TGTCATCGTC
GGCGTCGAAG AGGACAAGGC CACAAGAACT TGGACGCTGA CACCACAGAA GTTGGTGGGC
CTGCAGGAGC GCATCGAGCA GATCGCGAAC AGCCAGATCG ATCCTCCGCT GTCGATCGCG
ATCCGCGTAC TGGCGCTTGC GGAAGACCCA ACTCAGGGCT ACGTCGTGAT CCAGGTCCCA
GTCTCGCCGC GGGCACCCCA CATGGTCGGT GGCATCTACT ACGGCCGTGG TGAGACCCGT
CGCAATAGGC TATCCGACGC GGAGGTAACG CGGTACCACG CCGCACGTCG CGACACGACG
AGCCGGATCG AGGACCTGCT GCAGGCCGAG ATCGACCGGG ATCCGATCAT CGCCGCTGGG
CAGGCACGGC GCGGTCACTT GTATCTGGTC GCGCAGCCCG AAATCGACCA CGACGAACTT
GCTCTTACAC TGCTTGAAAG CGCGCAAGGG CCGATGGCGG AACCGTACCG CACGATCACA
GCTGGAGCCG AGCAGTTCGT CCACCAGAGT GTTCGAATCT ATGAACCGTC ACCTGGCTAT
GCGTCATCTA TCGCGATACG GTCGACGGGG CGTGCGTACT GCAGTCAAAC GCTCTCGGAT
GGCCGACGAT ACCGACCCGA CGGTGATTTC GAGAGTGATA CTGTCGATAT CGAAGTGCAC
GAAGACGGTG GGATACGGGC CATGGTCGGC CGCATGACCG AGGAATATGC TGGTCGCAAC
AGTCGAAGCA CGCCAGAGTC AGTCATTTTC GACGGCCTCG CAGTCGCATA TGCGGTCCGG
CTGGTTCATT GGGCGCGGAT GCTATCAGTA CTCACTGGGT ACCACTCGGG GTGGCTGTTC
GGTATCGCAG CAACCGGCCT CGAAGGCCGC CGCAGTCTTG TCTGGGCTCA GAGATTTCCA
CCCCGAGGCC CGCACTACGA CCAAGACAGC TACCGGCGAA CGACAACGGC GACACTCACC
GAAATGACCG AACATCCGGG AGTCGTAGCT AAAAGGCTCA TTGGAGCATT ACTCCGCGGA
CTCGGGACGA CCGAAGAGTT TCAGGAGGCG TTCAGCCAGC CACAGAATTG A
 
Protein sequence
MLTETHHLDI KREIGVKPGD RAELARDLAQ FAIDGGAVIV GVEEDKATRT WTLTPQKLVG 
LQERIEQIAN SQIDPPLSIA IRVLALAEDP TQGYVVIQVP VSPRAPHMVG GIYYGRGETR
RNRLSDAEVT RYHAARRDTT SRIEDLLQAE IDRDPIIAAG QARRGHLYLV AQPEIDHDEL
ALTLLESAQG PMAEPYRTIT AGAEQFVHQS VRIYEPSPGY ASSIAIRSTG RAYCSQTLSD
GRRYRPDGDF ESDTVDIEVH EDGGIRAMVG RMTEEYAGRN SRSTPESVIF DGLAVAYAVR
LVHWARMLSV LTGYHSGWLF GIAATGLEGR RSLVWAQRFP PRGPHYDQDS YRRTTTATLT
EMTEHPGVVA KRLIGALLRG LGTTEEFQEA FSQPQN