Gene Namu_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4043 
Symbol 
ID8449662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4456722 
End bp4457945 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content70% 
IMG OID645043088 
Productprotein of unknown function DUF1205 
Protein accessionYP_003203324 
Protein GI258654168 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.392322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.322652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCA TCCTGTTTGC ATCCGTTCCC GTGCACGGCC ACGTCACACC GCTGCTGTCC 
GCCGCCGGGC ATTTCGTGGC TCGCGGAGAC CGGGTTCGTT TCCTGACCGG GTCCCGGTTT
GCCGGGGTCG TGCAAGCGAC CGGCGCGCAG CACCTCCCAC TGCCGCCGGA GGCCGACTTC
GACGACCGGC AGGACCTCGC CGAAACATTC CCCGAGCGCG CACGACTCAC CGGGGCCAAG
TCCATCGCCT TCGACATCGA GCACGTCTTC GTGCGTCCCG GTCGCGCCCA GCACGACGCG
ATCATGAGCC TGCACCGGGA GGAACCCGCG GACGTGGTCC TGGTCGACAC CGCGTTCGTA
GGTGGTGCAT TCCTGCTGGG TCACCCGTTG CGCGACCGTC CACCGATCGT GGTCGGCGGC
GTGGTTCCGT TGACCATCAG CAGTCGCGAA ACCGCGCCCT ACGGCATGGG TTTGACGCCG
ATGCGCGGGC CCCTCGGCCG GTTGCGCAAT TCGGTGCTGA GAAAGATCGC GGCGCGTACC
GTCTTCCCGC CGGCCGAGCG CGTGGCCGAC GAGGTCCACG ACACGCTGTT CGGGCGCCCA
CTGCCGTTTC CGGTCCTGGA CTGGCCGCGG CATGCGGAGG CGATCGCGCA GTTCACCGTT
CCGGAGTTCG AGTATCCGCG CTCCGACGCG CCGGCCGGCC TGCATTTCGT CGGCCCGATC
TCGGCCACCG GCTCGCGGGC GGACCCACCG CCCTGGTGGG ACGAGCTGGA CGGGTCCCGG
CCCGTCATTC ACGTCACCCA GGGAACGATC GCAAACCGTG ACTACGACCA GATCATCGCC
CCCACGCTCA CGGCACTGGC CGGCCAAGAC CTCCTGGTCG TCGTCGCCAC GGGCGGGCGC
CCCGTGGACT CCCTCCCGCC GCTGCCGGCG AACGCGCGCG CGGCGACGTT CCTGCCCTAC
GACTCGCTCC TGCCCAAGAC CGATGTCTTC GTGACCAACG GCGGCTACGG CGGCGTCCAG
TACGCCCTTC GCTATGGGGT CCCGGTCATC ACCACCAGCG GTCACGAGGA CAAGCCCGAG
GTGGCTGCCC GAATAGCCTG GTCCGGCGCC GGCCGGCGGT TGAAACCACC AGGCCCACCC
CCGCCGCGGT CGCCGCCGCC GTCCGTTCGG TGCTCGAGGA TCCCGGCTAC CGCGCCCGCG
CGCAGGCCAT TGCGGCGAGC ATGA
 
Protein sequence
MASILFASVP VHGHVTPLLS AAGHFVARGD RVRFLTGSRF AGVVQATGAQ HLPLPPEADF 
DDRQDLAETF PERARLTGAK SIAFDIEHVF VRPGRAQHDA IMSLHREEPA DVVLVDTAFV
GGAFLLGHPL RDRPPIVVGG VVPLTISSRE TAPYGMGLTP MRGPLGRLRN SVLRKIAART
VFPPAERVAD EVHDTLFGRP LPFPVLDWPR HAEAIAQFTV PEFEYPRSDA PAGLHFVGPI
SATGSRADPP PWWDELDGSR PVIHVTQGTI ANRDYDQIIA PTLTALAGQD LLVVVATGGR
PVDSLPPLPA NARAATFLPY DSLLPKTDVF VTNGGYGGVQ YALRYGVPVI TTSGHEDKPE
VAARIAWSGA GRRLKPPGPP PPRSPPPSVR CSRIPATAPA RRPLRRA