Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3985 |
Symbol | |
ID | 8449604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4399753 |
End bp | 4401774 |
Gene Length | 2022 bp |
Protein Length | 673 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645043030 |
Product | protein of unknown function DUF349 |
Protein accession | YP_003203266 |
Protein GI | 258654110 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0275004 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0113989 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGACG ACGTCACGAC CACCGAAGGA CATTCTTCCG GAGAGCCGGC CGGCGCGGAC CGCCCGGACC CCACGCTCAG CGTGCCGGAG ACCCCGGATT CGCCGGACAT CCCCACCCCG GAACCGGGCG CCCCGGCGCC GGCTCCGGAG GTGCCGGACC TGCCCGAACT CCCCGACCTA CCGGACCTTC CCGACGCTCC CGCGCCCCCG TCGACCACGA CGCCCACCGA CGACGCCGCA CCCGCGGCCG GGACGGACGA GGGCTCGGCC GCGGGTGCGG ACGAGCTCGC CCACGCCGGA TCGAGCGGGC CCGTCCCGAT GCCGACGGTC GCCCCACGCA CGCCACCGGT CGCGATCCCG ACCGTGGCGC CGTCGATCGA ACCCGCGCCG ATGCCCTCCC CGATCACGAC CAGCACCCCG GCCGATGCGC CCGACCAGCC GACCGTCGAG CCCGCGACGG ATCAACCGGC ACCGGCCGAC CAGGCCGACG CTGCACCGGC CGAGCAGACC GACGCTGCAC CGGCCGACCA GGCCGACGCT GCACCGGCCG ACCAATCCGA CGACGCCGCG CCCGAGCCGG CGGCCGCTCC ACACCCGGAC GCCGACCCGG CCACCAGCAC GCCGGCTCCC GCGACCCCCG GGCAGCGGCC GGGCCAGCGT GGTCCTCGTC CCCGCGGCGG GCGTCCCGGG TCGCCGGGGT CGCGCCCCGG TGGCCGGCCG GGTGGCCCGG GCGGCGGCCG GCCCAGCCCG GTGCCGGCGA CGCCCACGCA CGCCCCGGCG CACTCGACGG TGGTCGAGCC GGTGGTCGAC AGCGTCGACC CGCACGAGTG GGGCCGCATC GACGACGACG GTGTCGTCTA CGTCCGCACG GCCGCCGGCG AGCGCGCGAT CGGCAACTGG CAGGCCGGTG ACGCGGAGGC CGGGCTGGCC CACTTCGGCC GCAAGTTCGA CGACTTCAAC ACCGAGATCG CTTTGCTGGA GGCGCGTCTG GCCTCCGGCA CGGGTGATCC CAAGGCCACG AAGGCGCAGG CCATCGCGCT GCGCGACCAG GTCGAGTCGC TGTCGGCGAT CGGTGATCTG GACAACGCGG CGGTCCGGCT GGAGATCGTG ATCGGCGTCG CCGACGCCGC CATCGCCGGC GCCTCCACCG CCCGGATCGC GGCCCGCAAC GCCGCGATCA AGGCCAAGGA GGACCTGTGC GCGGAGGCGG AGGCCCTGGC CGAGTCGACC CAGTGGAAGT CCACCGGCGA CCGGCTCAAG GCGATCGTGG ACGAGTGGCG CACCATCCGC GGCATCGACC GCAAGACCGA TGACGCGCTG TGGAAGCGGT TCGCCAAGGC CCGGGACACC TTCACCCGGC GCCGGGGCTC CCACTTCGCC GAACTGGACA AGCAGCGGGG CGCCGCCCGG GAGGCCAAGG AAGAGCTGAT CAAGCGGGCC GAGGCGCTCT CGGACGCCAG CGACTGGGGC GAGACGGCGG CCAAGTACCG CGCCCTGATG GAGGAGTGGA AGGCCACCGG CCGGGCTCCG CGCGACGTCG AGGACGCGCT GTGGGCGCGT TTCCGGGCCG CGCAGGAGAA GTTCTTCTCC CGCCGGAACA AGGTGTTCTC CGACCGGGAC GCCGAGTTCG CGGCCAACGC CGCGACCAAG GAAAAGCTGC TCACCGAGGC CGAGAAGATT GACCCGGCAG CCGGTCTGGA CCAGGCCAAG GCCAAGATGC GCTCGATCCA CGAGCGCTGG GAGGCGGCCG GCAAGGTCCC CCGGGAACGG ATCCGGGACC TCGATCAGCG CCTGAAGACG ATCGAGGACC GCATCAAGGC GGCCGAGGAT CGGCAGTGGC GGCGCACCGA CCCGGAGACC GACGCGCGGG TGGCGCAGTT CCGGGCCCGG GTCGAGTCGT TCCAGGCCCA GGCGGCCAAG GCCCGCGCGG CCGGGGACGA GCGCAAGGCC AAGCAGGCCG AGGCCCAGGC CAAGCAGTGG GAGGAATGGC TCAAGACCGC GCAGAGCGCG GTCGACCGTT AG
|
Protein sequence | MPDDVTTTEG HSSGEPAGAD RPDPTLSVPE TPDSPDIPTP EPGAPAPAPE VPDLPELPDL PDLPDAPAPP STTTPTDDAA PAAGTDEGSA AGADELAHAG SSGPVPMPTV APRTPPVAIP TVAPSIEPAP MPSPITTSTP ADAPDQPTVE PATDQPAPAD QADAAPAEQT DAAPADQADA APADQSDDAA PEPAAAPHPD ADPATSTPAP ATPGQRPGQR GPRPRGGRPG SPGSRPGGRP GGPGGGRPSP VPATPTHAPA HSTVVEPVVD SVDPHEWGRI DDDGVVYVRT AAGERAIGNW QAGDAEAGLA HFGRKFDDFN TEIALLEARL ASGTGDPKAT KAQAIALRDQ VESLSAIGDL DNAAVRLEIV IGVADAAIAG ASTARIAARN AAIKAKEDLC AEAEALAEST QWKSTGDRLK AIVDEWRTIR GIDRKTDDAL WKRFAKARDT FTRRRGSHFA ELDKQRGAAR EAKEELIKRA EALSDASDWG ETAAKYRALM EEWKATGRAP RDVEDALWAR FRAAQEKFFS RRNKVFSDRD AEFAANAATK EKLLTEAEKI DPAAGLDQAK AKMRSIHERW EAAGKVPRER IRDLDQRLKT IEDRIKAAED RQWRRTDPET DARVAQFRAR VESFQAQAAK ARAAGDERKA KQAEAQAKQW EEWLKTAQSA VDR
|
| |