Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2641 |
Symbol | |
ID | 9246492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3150277 |
End bp | 3151503 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | protein of unknown function UPF0118 |
Protein accession | YP_003680564 |
Protein GI | 297561590 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.517547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.208528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCACACGA GCACCACACC GCCCGGGTCC GCGCGCCCCG ACCGCGGCGG CGGCATGCCG CGCTGGCTCC CCCGGGCCAT GCTGCTGGCC CTGTGGCTGG TCACCGCGTT CGGCCTCACC CTGTGGCTGT TCGTCCGGTT GCAGAGCCTC ATCATGCTGC TGCTGATCTC GCTGTTCCTC GCCCTGGCGC TGGAACCGGC GGTCAACTGG CTCCACCGGC ACCGCTGGCC GCGCGGGCCC GCCACCGGGC TGGTGATGCT GCTGGTACTG GCGCTGACCG TGGTGTTCCT CAGTCTGCTC GGGTCGATGC TGGTCGGCCA GATCCTGGCC TTCGTCTCCG AGATCCCCGC GATGATCCGC GCCGCGCTGG CCTGGGTCAA CACCACGTTC GACACCTCCT ACTCCCCCAC CACCCTGCTC AACGAGATCT CCAGCGCCAG CGGGCTGATC GAGCAGTACG CCTCCGGTAT CGCCAACAAC GTCTGGGGCG CCGGGACGAC CGTCCTGGCG CTGCTGTTCA ACGCGCTGAC GATCGCGCTC TTCACCTTCT ACCTGTGCGC CGACGGCCCG CGCTTCCGCC GCGTGATCTG CTCGGTCCTG CCGCCGCGCA CCCAGCGCGA GGTGCTGCGG GCCTGGGAGA TCGCGATCAC CAAGACGGGC GGTTACCTCT ACTCCCGTGC GCTGCTGGCC CTGGTCTGCT CGGGCGCGCA CTACGTGGTG CTGGTCGCGC TGGACATCCC CTTCGCGTTC GCCCTGGCGC TGTGGGTGGG CGTGCTGTCG CAGTTCATCC CCACCGTGGG CACCTACATC GGCGGGGTCG TCCCGGTGCT CGTGGCGCTG ATGGAGGGCA TCTGGCCCGC CGTGTGGGTG CTGGTGTTCA TCGTCGTCTA CCAGCAGTTC GAGAACTACC TGCTCCAGCC GCGCATCACC GCCAGGACCC TGGACATGCA CCCGGCGGTG GCGTTCGGCT CGGTCCTGGC GGGCGTGGCC ATCCTGGGGG CGCCCGGCGC GCTGCTCGCG CTGCCGATGG GCGCGAGCAT GCAGGCGTTC CTGGGGACCT ACATCCGGCG CTACGAGGTG GCCGAGCACC CCCTGCTCTC CGACGCCGAG GAGGACGGGA AGGGCGGGAA ACCCTCCCCG GATCCGGTGG CCCCGGTCTC CGAGGGCGAC GGGGGCGACG CGCGTCCCCC CGGTCCGCGG GAGCGGGAGG GCGGGGAGGG ACCGTGA
|
Protein sequence | MHTSTTPPGS ARPDRGGGMP RWLPRAMLLA LWLVTAFGLT LWLFVRLQSL IMLLLISLFL ALALEPAVNW LHRHRWPRGP ATGLVMLLVL ALTVVFLSLL GSMLVGQILA FVSEIPAMIR AALAWVNTTF DTSYSPTTLL NEISSASGLI EQYASGIANN VWGAGTTVLA LLFNALTIAL FTFYLCADGP RFRRVICSVL PPRTQREVLR AWEIAITKTG GYLYSRALLA LVCSGAHYVV LVALDIPFAF ALALWVGVLS QFIPTVGTYI GGVVPVLVAL MEGIWPAVWV LVFIVVYQQF ENYLLQPRIT ARTLDMHPAV AFGSVLAGVA ILGAPGALLA LPMGASMQAF LGTYIRRYEV AEHPLLSDAE EDGKGGKPSP DPVAPVSEGD GGDARPPGPR EREGGEGP
|
| |