Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4043 |
Symbol | |
ID | 8449662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4456722 |
End bp | 4457945 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645043088 |
Product | protein of unknown function DUF1205 |
Protein accession | YP_003203324 |
Protein GI | 258654168 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.392322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.322652 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCCA TCCTGTTTGC ATCCGTTCCC GTGCACGGCC ACGTCACACC GCTGCTGTCC GCCGCCGGGC ATTTCGTGGC TCGCGGAGAC CGGGTTCGTT TCCTGACCGG GTCCCGGTTT GCCGGGGTCG TGCAAGCGAC CGGCGCGCAG CACCTCCCAC TGCCGCCGGA GGCCGACTTC GACGACCGGC AGGACCTCGC CGAAACATTC CCCGAGCGCG CACGACTCAC CGGGGCCAAG TCCATCGCCT TCGACATCGA GCACGTCTTC GTGCGTCCCG GTCGCGCCCA GCACGACGCG ATCATGAGCC TGCACCGGGA GGAACCCGCG GACGTGGTCC TGGTCGACAC CGCGTTCGTA GGTGGTGCAT TCCTGCTGGG TCACCCGTTG CGCGACCGTC CACCGATCGT GGTCGGCGGC GTGGTTCCGT TGACCATCAG CAGTCGCGAA ACCGCGCCCT ACGGCATGGG TTTGACGCCG ATGCGCGGGC CCCTCGGCCG GTTGCGCAAT TCGGTGCTGA GAAAGATCGC GGCGCGTACC GTCTTCCCGC CGGCCGAGCG CGTGGCCGAC GAGGTCCACG ACACGCTGTT CGGGCGCCCA CTGCCGTTTC CGGTCCTGGA CTGGCCGCGG CATGCGGAGG CGATCGCGCA GTTCACCGTT CCGGAGTTCG AGTATCCGCG CTCCGACGCG CCGGCCGGCC TGCATTTCGT CGGCCCGATC TCGGCCACCG GCTCGCGGGC GGACCCACCG CCCTGGTGGG ACGAGCTGGA CGGGTCCCGG CCCGTCATTC ACGTCACCCA GGGAACGATC GCAAACCGTG ACTACGACCA GATCATCGCC CCCACGCTCA CGGCACTGGC CGGCCAAGAC CTCCTGGTCG TCGTCGCCAC GGGCGGGCGC CCCGTGGACT CCCTCCCGCC GCTGCCGGCG AACGCGCGCG CGGCGACGTT CCTGCCCTAC GACTCGCTCC TGCCCAAGAC CGATGTCTTC GTGACCAACG GCGGCTACGG CGGCGTCCAG TACGCCCTTC GCTATGGGGT CCCGGTCATC ACCACCAGCG GTCACGAGGA CAAGCCCGAG GTGGCTGCCC GAATAGCCTG GTCCGGCGCC GGCCGGCGGT TGAAACCACC AGGCCCACCC CCGCCGCGGT CGCCGCCGCC GTCCGTTCGG TGCTCGAGGA TCCCGGCTAC CGCGCCCGCG CGCAGGCCAT TGCGGCGAGC ATGA
|
Protein sequence | MASILFASVP VHGHVTPLLS AAGHFVARGD RVRFLTGSRF AGVVQATGAQ HLPLPPEADF DDRQDLAETF PERARLTGAK SIAFDIEHVF VRPGRAQHDA IMSLHREEPA DVVLVDTAFV GGAFLLGHPL RDRPPIVVGG VVPLTISSRE TAPYGMGLTP MRGPLGRLRN SVLRKIAART VFPPAERVAD EVHDTLFGRP LPFPVLDWPR HAEAIAQFTV PEFEYPRSDA PAGLHFVGPI SATGSRADPP PWWDELDGSR PVIHVTQGTI ANRDYDQIIA PTLTALAGQD LLVVVATGGR PVDSLPPLPA NARAATFLPY DSLLPKTDVF VTNGGYGGVQ YALRYGVPVI TTSGHEDKPE VAARIAWSGA GRRLKPPGPP PPRSPPPSVR CSRIPATAPA RRPLRRA
|
| |