Gene Namu_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1120 
Symbol 
ID8446716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1249133 
End bp1250227 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content71% 
IMG OID645040257 
Productprotein of unknown function DUF808 
Protein accessionYP_003200516 
Protein GI258651360 
COG category[S] Function unknown 
COG ID[COG2354] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGTC TGTTCGCCCT GCTGGACGAC GTCGCGGCGC TGGTAAAGCT GACGGCGTCT 
TCGCTCGACG ACATCGCCGG GGCGACGGGC CGGGCCAGTG TGAAGGCCGC CGGGGTGGTC
GTCGACGACA CCGCGGTCAC CCCGCGCTAC GTGCAGGGCC TCAAGCCCGA GCGTGAGCTG
TCGATCATCT GGCGCATCGC CAAGGGCTCG CTGCGCAACA AGCTGCTGAT CATCCTGCCG
GTCGCGCTGC TGCTGTCCCA GTTCGCGCCG TGGGCCCTGA CCCCGATCCT GATGGTCGGC
GGCACGTACC TGTGTTACGA GGGTGCGGAG AAGCTGTGGG AGAAGTTCTC CGGCCACGAG
GCGCAGGCCC AGGACCCGGA CGAGGTCGAG GCCGTCGACC CGGCCGAGCA CGAGAAGCGG
GTCGTCTCCT CGGCCACCCG CACCGACTTC ATCCTCTCCG CCGAGATCAT GGTCATCGCG
CTGGACGAGG TGGCCAGCGA GGGCTTCGTC GCCCGGGCCA TCATCCTGGC CATCGTCGCG
GTCCTGATCA CCGCGCTGGT CTACGGCGTC GTCGGCCTGA TCGTGAAGAT GGACGACGCC
GGGCTGGCCC TGGCCCGCAA GCCCAGGCGC GCGGTGGCCG GCTTCGGGCG CGGCCTGGTC
AAGGCCATGC CCATCGTGCT GAGCACCCTG TCCTGGGTCG GCGTGGTGGC CATGCTCTGG
GTCGGCGGGC ACATCCTGCT GGTCGGCATG GACGAGCTGG GCTTCCATCT GCTCTACGGC
TGGGTGCACC ACCTGGAAAC CGCTGTGCAC GACGCCACCG GCGGGGCCGG GGCCGCCCTG
GGCTGGGTGA CCAACACGTT CTTCTCGGCC GTCCTGGGCC TGCTGGTGGG CGCCATCGTG
GTCGCTGTGC TGCACGTGTT GCCGATCGGG CGCAAGACGG CCGGCCATGG CGCCGACGAC
GGGGCTGCGG GGCACGGGGC TGCGGGGCAC GGGGCCGGCC CGGCCACCCC CGATCCGGCC
ACCCCCGATC CGGCCACCCC CGATCCGGGC CCGTCCGAGC GGAGCACGCC CGATCCGGAT
GAGCCCAAGG GCTGA
 
Protein sequence
MSGLFALLDD VAALVKLTAS SLDDIAGATG RASVKAAGVV VDDTAVTPRY VQGLKPEREL 
SIIWRIAKGS LRNKLLIILP VALLLSQFAP WALTPILMVG GTYLCYEGAE KLWEKFSGHE
AQAQDPDEVE AVDPAEHEKR VVSSATRTDF ILSAEIMVIA LDEVASEGFV ARAIILAIVA
VLITALVYGV VGLIVKMDDA GLALARKPRR AVAGFGRGLV KAMPIVLSTL SWVGVVAMLW
VGGHILLVGM DELGFHLLYG WVHHLETAVH DATGGAGAAL GWVTNTFFSA VLGLLVGAIV
VAVLHVLPIG RKTAGHGADD GAAGHGAAGH GAGPATPDPA TPDPATPDPG PSERSTPDPD
EPKG