Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0989 |
Symbol | |
ID | 9244835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1209058 |
End bp | 1210062 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Arabinan endo-1,5-alpha-L-arabinosidase |
Protein accession | YP_003678939 |
Protein GI | 297559965 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0127699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAAC ACCCGCACAC CCTCTCCCGC CGCAGAATGC TGGGCTCCGC CCTGGCCGCG GGAGCGCTGG CCGCGACCAC GACCGCCGCC GACGCGGTCG GCGCCCGGCC CGCGGCGGCC GCCGACTACC CCGGACCCGG CCATGTGACC GGCGACGTCC GGGTGCACGA CCCGTCCTTC GTCAAGCGGC CCGGGGGCGG CTACCTGATC GCCCACACCG GCGACGACGT GGCCCTGAAG ACCTCCGCCG ACCGGACCGC CTTCCGCGAC GCGGGTCCGG CCTTCCCCGG CGGCGCCCCG TGGACCAGCC CCTACACGGG CGGCGCGCGC AACCTGTGGG CCCCGCACCT GTCCCACGCC GACGGGCGCT ACCGCCTCTA CTACTCCGCC TCGACCTTCG GCTCCAACCG GTCCGCGATC TTCCTGGCCA CCAGCACCAG CGGGGACTCG GGCACCTGGC GCGACGAGGG CCTGGTCATC GAGTCGTTCC CCTCCGACGA CTTCAACGCC ATCGACCCCC ATCTGGAGGT GGACGGCCAG GGCCGCTGGT GGCTGGCGTT CGGCTCGTTC TGGTCGGGGA TCAAGATGGT GCGCATCGAC CCGGCGACCG GCAAGCGCGG CGACCGCGTC CTGCACTCGA TCGCGGGCCG CGGCGGCGAC GCCGTCGAGG CGCCGACCCT CTTCCAGCGC GACGGCTGGT ACTACCTGTT CGTCTCCTTC GACCTGTGCT GCCGGGGAGC GGACAGCACC TACCGCATCA TGGTCGGCCG CTCCACCAGC ATCACCGGCC CCTATCGCGA CCGCGCCGGA AGGGCCATGA CCGCGGGCGG CGGCACCGAG GTCCTGTCCG GCCACGGCGG CGTGCACGGC CCCGGCCACC AGGACGTCTT CGCCGACACC GACCACGACA TCCTCGCCTA CCACTACTAC GCCGACGACG GCACGGCGCT GCTGGGCGTC AACTGGCTCG GCTGGGACTC CGCCGGATGG CCGTACGTGC ACTGA
|
Protein sequence | MTEHPHTLSR RRMLGSALAA GALAATTTAA DAVGARPAAA ADYPGPGHVT GDVRVHDPSF VKRPGGGYLI AHTGDDVALK TSADRTAFRD AGPAFPGGAP WTSPYTGGAR NLWAPHLSHA DGRYRLYYSA STFGSNRSAI FLATSTSGDS GTWRDEGLVI ESFPSDDFNA IDPHLEVDGQ GRWWLAFGSF WSGIKMVRID PATGKRGDRV LHSIAGRGGD AVEAPTLFQR DGWYYLFVSF DLCCRGADST YRIMVGRSTS ITGPYRDRAG RAMTAGGGTE VLSGHGGVHG PGHQDVFADT DHDILAYHYY ADDGTALLGV NWLGWDSAGW PYVH
|
| |