Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4445 |
Symbol | |
ID | 8450072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4930069 |
End bp | 4931334 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645043492 |
Product | Protein of unknown function DUF1972 |
Protein accession | YP_003203720 |
Protein GI | 258654564 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.733886 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCATGA AGTCGAAGCA GGCCCGCCCG CTGCGGATCG CGCTGGTGGG AACCCGCGGG GTGCCGGCCC GGTACGGCGG GTTCGAGACC TGTGTCGAGG AGGTCGGCTC CCGGCTCGTC GAGCGTGGCC ATGAGGTGGT CGTCTACTGC CGGCGCCGCG GCTCGGATCG CAGCGCGGAG CTGGACAGCT ACAAGGGCAT GTCCCTGGTG CACTTCGGGG CCCTGAAGAA GCGCTCGCTG GAGACCTTGA GCCACACGGC GCTTTCGGTG CAGCACCTGG TCCGCCATCG GACCGACGCG GCGGTGGTCT TCAACGCGGC CAACGCGCCC TTCCTGCCGG CTCTGCGGGC GGCCCGGATC CCCGTGGCCA CCCACGTGGA CGGGCTGGAA TGGAAGCGCG ACAAGTGGGG CGGCGCCGGC CGTCGTTACT ACCTGATGGC CGAGCGACTG GCGGTCAAGT GGTCGGACGC GCTGATCGCC GACGCGGTCG GCATCCAGGA CTACTACCTG GACAAGTTCG CCATGCCCAC CGATCTGATC ACCTACGGCG CCCCGATCCT GGACACCGTC GGCGACCACC GGCTGGCCGA GCTCGGCCTG ACCTCGGGCG GATACCACCT GGTGGTGGCC CGATTCGAGC CGGAGAACCA CGTGGACATG ATCGTGGAGG GCTACTCGGC CAGTGCGGCC GAGCTCCCGC TGATCGTCGT CGGGTCCGCG CCGTACGCGG ACGCCTACAC CCAGCGAGTG CACGAACTGG CCGATGGCCG GGTGCGGTTC CTCGGTGGGG TGTGGGACCA GCAGCTGCTG GACCAGCTCT ACGCCAACGC CTTCACCTAC CTGCACGGGC ATTCGGTGGG TGGGACCAAC CCCTCGCTGC TGCGGGCACT CGGGGCCTCG GCCGCGACCA CTGCGTTCGA CGTCAACTTC AACCGTGAGG TGCTGGGCGG GGCCGGTCGG TTCTTCTCCG ACGTGGCCGG GGTTCGCGCG CAGATCGAGG CCTCCGAACT GGACATCGCC AGCACCGTCG AACTGGGTAC CCAGGCTCGG ATCCAGGCGA CCAAGTACGA CTGGGACGAT GTGACCGACC GCTACGAGGA CCTGTGCCTT CGCCTGGCCG GACGCGATCG GGCGTTGGCC GGTCCTCGGA CCGACGCGGT GGCCGCGGCC CCGCTCGATG CCTGGCTGGC GGAGATCGGC ATCGCCGGGT CGGCGGAGCC GGTGCGCGTG CGGACCGGCT CGGAGCCCTC GCTCCGATCG GCATGA
|
Protein sequence | MIMKSKQARP LRIALVGTRG VPARYGGFET CVEEVGSRLV ERGHEVVVYC RRRGSDRSAE LDSYKGMSLV HFGALKKRSL ETLSHTALSV QHLVRHRTDA AVVFNAANAP FLPALRAARI PVATHVDGLE WKRDKWGGAG RRYYLMAERL AVKWSDALIA DAVGIQDYYL DKFAMPTDLI TYGAPILDTV GDHRLAELGL TSGGYHLVVA RFEPENHVDM IVEGYSASAA ELPLIVVGSA PYADAYTQRV HELADGRVRF LGGVWDQQLL DQLYANAFTY LHGHSVGGTN PSLLRALGAS AATTAFDVNF NREVLGGAGR FFSDVAGVRA QIEASELDIA STVELGTQAR IQATKYDWDD VTDRYEDLCL RLAGRDRALA GPRTDAVAAA PLDAWLAEIG IAGSAEPVRV RTGSEPSLRS A
|
| |