Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2803 |
Symbol | |
ID | 9246654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3348055 |
End bp | 3349380 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | protein of unknown function UPF0118 |
Protein accession | YP_003680721 |
Protein GI | 297561747 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.168707 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0979616 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAAGCCC AGCGGCCGGG AGTGTGGGCG CTCCTGAACA AGTGGCTCTC CGCACGCAGA GCCCGCGCCG AGCGCCTCGC CAGGCTGGAG GCGGAGAACG AGCGCTCCCG TGCCGAGCCC CAGGCCGCGG ACGAGGGGCC GACCGAGCAG CACCAGGGAG ACGACAACCT CCTGCGGTCC ATCAGCGACG TGGCCTGGCG GGTACTGCTC ATCGGCGTGG TGGCGGGCCT GCTCGTCTAC GTGCTCGTCT ACCTGTCGGT CGTCACGCTG CCGGTGATCC TGGCGGTGTT CCTCACCGCC CTGCTCATGC CGATCGCCAA CGGGCTGCGC CGCAAGGGGC TGGGCAGGGG GCTGTCGACC ACCATCGCCC TGCTGGTCGG ACTCATCGTC TTCGGCGGCG TGATCTCGCT GATCGTCACG CCCGCGATCC AGGGCTTCGG TCCGCTGGTG GACAGCGTCA CCAGCGCGAT CACCGAGCTC CAGGACATCC GGCTGCCCTT CGTCGACCCG GCCCTGTTCA CCGACATGAT CGACGACGCC TGGGCGCAGA TCCAGAGCAT GATCACCGAG AACCAGGACC AGCTGCTCAG CGGCGCCTGG ACCGCCACCT CGGCGGTGAT CTCGGTCCTG GTCGGCATCG TCCTGATCAT CGCCCTGACC GTGTACTTCG TGCACTCGGG CGACCAGCTC ATGGACTGGC TGGTCACCCT GCTGCCGGCC CGCTCGCGCC CGGGCATGCG CCACGCGGGC GACGTCGCCT ACGGGGTCAT GGGGCGCTAC GTGCGGGGCG TGGCCGCGGT CGGCTTCTTC GACGCCGTCG GTATCGGTAT CGCCCTGGTC ATCTTCCTCG ACATCAACCT GGCCATCCCG CTGATCGTGC TGACCTTCGT CGGGGCCTTC CTGCCGATCA TCGGCGCCTT CCTCACCGGC CTGCTCGCCG CCCTGGTGGC CTTCGTGACC GAGGGCTGGG TCGTGGCCCT GATCATCGTC GGCGCCGTGC TCCTGGTGCA GCAGCTGGAG AGCAACGTCT TCGCGCCGCG CATCTACGGC GCCTCGCTCG ACCTGCCCTC GCCGGTCGTG CTCATCGGGA TCTCCGTCGG CGCGGTCGTC GGCGGTATCC CCGGCATGTT CCTGTCCACC CCGGTGGTCG CCGTGCTGGC CGCGCTGCTG CGCAACCGCC CGCCCTCCAG CGGTGACGAC TCCGGCGGAG GGGACGCGGA CGTGGCCGAG GTCAAGGCGG ACACCGTCGT GGTCAGGGCC GACCAGGGGA CCGGCCAGGA CGCCTCCGGC GGAGCCACCG CGGTCGATCC CCCCGAACAG AAGTAG
|
Protein sequence | MQAQRPGVWA LLNKWLSARR ARAERLARLE AENERSRAEP QAADEGPTEQ HQGDDNLLRS ISDVAWRVLL IGVVAGLLVY VLVYLSVVTL PVILAVFLTA LLMPIANGLR RKGLGRGLST TIALLVGLIV FGGVISLIVT PAIQGFGPLV DSVTSAITEL QDIRLPFVDP ALFTDMIDDA WAQIQSMITE NQDQLLSGAW TATSAVISVL VGIVLIIALT VYFVHSGDQL MDWLVTLLPA RSRPGMRHAG DVAYGVMGRY VRGVAAVGFF DAVGIGIALV IFLDINLAIP LIVLTFVGAF LPIIGAFLTG LLAALVAFVT EGWVVALIIV GAVLLVQQLE SNVFAPRIYG ASLDLPSPVV LIGISVGAVV GGIPGMFLST PVVAVLAALL RNRPPSSGDD SGGGDADVAE VKADTVVVRA DQGTGQDASG GATAVDPPEQ K
|
| |