Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3508 |
Symbol | |
ID | 9247377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4213543 |
End bp | 4216059 |
Gene Length | 2517 bp |
Protein Length | 838 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003681415 |
Protein GI | 297562441 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.40387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATGG TTCTGCGCGG CGCCGCCGAC CGCGCGCGCC AACTGGCCCT GTCGGTCCTG ACCGTGGCCC TGGGCGCCGG GCTGGCCACG GCCGTCCTCG CCCTCCAGGA CTCGGCCGAG CGGGTGGCCG CCGGGGGAGC CGGGGCCTCC TGGACGCTGT CACGGGCACC GGTCGTGGTG ACGGCCGTCC CCGAGGAGGC CGCGGCCGGG ATCACCGCCT CCCCGCTGGG GGAACCGCCC CGGCTGGACC CGGATACCGT AGCGGAACTG GAGCGCCTCC CCGGTGTGCG CCGGACGGCG GTCGAGGCGC CCTTCTCCGC CTACGTGGTC ACCGCCGACC GTACGCTCGG CGGCCACTCC GACCGCTCCT TCGGCCACTC ATGGGCGCTC GCCGAGGCCG AGGGGCTCAC CCCCGCCACC GGGCGGGCCC CCGAGAGCGT GGGCGAGGTG GTCCTCGACA CCCGCACCGC CGCCGACGCC GGGCTCACCC CCGGCGACCG TGCGCGGGTC CTGACCTCCG ACGGGACCGC CGACGTGCTG GTCACCGGAA CCGTCGAGCG CGGCGGCGCC CCGGACCGGG CCCTGTTCTT CCCACCCGTC GAGGCCGCGC GTCGGGGCGG GGATCCGGTC CTGGCCCTGC TCTGGCCCGG GGAGGGAACC GGCCCCGACC GGCTGGCCGG GGCGGTCGAG GAGGCCGCGC CCGGCGCCCG GGTGCTCACC GGGGATGAAC GCTCCACGGC GCTGGCCCTG GACGGGGAGA ACCGGGACCT GGCCTCGGGC ATGGGCCGGT TCCTGGGCAC CATGGCCGCG CTGGCGCTGG CGGTGGCCGC CGTCACGGTC GCGGGCCTGC TCTCCCTGAC CGTGCGCGAC CGCGCCCGGG AGTTCGCCCT CCTGCGCCTG GCCGGGGCCC GGCCGGGGCT GGTGCGCCGA CTCGTCGTCG GGGAGGCCCT GGTCCTGGGG TGCGTGGCCG CCGCGCTCTC CTGCGTCGTG GGTACGGCGC TGGCGCTGCT CCTGAGCAGG CTCTTCGCGG AGCTGGGAGC GCTGCCGGAC GGGTTCGCGC TGGTCCTGGG CTGGCCCCCG CTCGCGGCGG GCGCGGCGCT GGCCCTGGCC GTGCCGCTGG CGGCCTCCTG GCGCCCCGCG CTCACCGCGG GCCGGATCGC GCCCGTGGAG GCGATGCGCG CGGCGCAGGC CGAACCCGTC TCCTTCTCCC GTGCCCGCCC GGTGCTCGGC ACCGTGGTCC TGTGCGGCGC GGCGGCCCTG TTCGCGACGG CCTGGGGCCT GGCCGGAACC GTGGTGGCGG TGACCGCCGC CGCCACGGCC GCGCTGGTCC TCGTCGCCGC CGCGGTCCTG CTCTCCCCGG TGCTGGTCCA CGCGGTGCTG CTCCTGCTGC GCCCGCTCAC CCGCAGGAGG GCCGCCTCCC TGGTCGCCGA CCGGGAGGCG CGCGCCGACG TCCGCCGGGT GGCGGGCGTC ATGACGCCCC TGCTGGTGAC CACGGCGGTC GCCTGCCTGC TGCTCTTCCA GGAGACCACC ACCACCGAGG CCCGCCTTCG CGCCTACGGG GAGCGCCTGG CCGCCGACCT GGTGGTGTCC GGGGCCCTGG GCGTGGGCCT GCCCGCGTCC GCGGCCGAGG CCGCCGAGGG CGTGCCCGGG GTCGCCGCCG CCGGGGGCTA CCGCCAGACG GTCACCTCCG CGGGCGGGCC GTACCTGACC ACCCACCTGG TCGAACCCGA GACGGTGCCG CGGATCTACG ACCTGGCGGT GGAGGGTGGC GCGTGGGAGG ACTTCGGCAC CGGCGGCGTC GCCGTGCGCG CCGACACCGC CCGGAGCCGG GGGTGGCGCG CGGGGCGGAC CGTGGAACTG CTCGGCCCCG ACGGTACCGG GTTCACGGCC CGGGTGTCGG TCCTGTACCG GGCGGGGCTC GACTTCCCCG ACGTCCTGCT GCCCGAGGAG GCCGTCGCGC CCCGGATGCT CGACACCCTG CACAACGGCC TGTACGTGGT GCTCGACCCC TCGGCCGACG CCGGGAGGAC GGCCTCCCTG CTGGAGGAGG CGATCGACGC CGGGCCCGAA CTCCGGGTCA GCGACCGCGC CGGACACATC GCGGACCAGG CCCGGCTCGG CCAGGAGGAC GCGTGGATCA CCCACCTCAT GGTGGCCCTG GTGGCGGGCT TCGCCGGGGT GAGCGCGGTC AACGCCCTCG TGGTCTCCGT CTCGGCCCGC GCGCGGAGCT TCGCCCTGCT GCGGCTGGTG GGGGCCTCGC GGGCCCAGGT CGCCGGGATG GTCGCGGGGG AGGCCCTGGC GGTGTCGCTG GCGGGGGTGG CGCTGGGCAC CGCCACGGCG CTGACCGGGG TGGCCGCCGT GGGCCACGCC CTCGTCGGCG GCGGGACGGT GGTGCTCGCC GTCCCCCTGG ACCAGTACCT GCCGCTGGCC GGTGCCGTCG TGGGCATCGG GCTGCTCGCG AGCCTGGTCC CGGCCGTCGC GGCCCTGCGC GCCCGCCCGC TGCACGCCGC CGGGTGA
|
Protein sequence | MRMVLRGAAD RARQLALSVL TVALGAGLAT AVLALQDSAE RVAAGGAGAS WTLSRAPVVV TAVPEEAAAG ITASPLGEPP RLDPDTVAEL ERLPGVRRTA VEAPFSAYVV TADRTLGGHS DRSFGHSWAL AEAEGLTPAT GRAPESVGEV VLDTRTAADA GLTPGDRARV LTSDGTADVL VTGTVERGGA PDRALFFPPV EAARRGGDPV LALLWPGEGT GPDRLAGAVE EAAPGARVLT GDERSTALAL DGENRDLASG MGRFLGTMAA LALAVAAVTV AGLLSLTVRD RAREFALLRL AGARPGLVRR LVVGEALVLG CVAAALSCVV GTALALLLSR LFAELGALPD GFALVLGWPP LAAGAALALA VPLAASWRPA LTAGRIAPVE AMRAAQAEPV SFSRARPVLG TVVLCGAAAL FATAWGLAGT VVAVTAAATA ALVLVAAAVL LSPVLVHAVL LLLRPLTRRR AASLVADREA RADVRRVAGV MTPLLVTTAV ACLLLFQETT TTEARLRAYG ERLAADLVVS GALGVGLPAS AAEAAEGVPG VAAAGGYRQT VTSAGGPYLT THLVEPETVP RIYDLAVEGG AWEDFGTGGV AVRADTARSR GWRAGRTVEL LGPDGTGFTA RVSVLYRAGL DFPDVLLPEE AVAPRMLDTL HNGLYVVLDP SADAGRTASL LEEAIDAGPE LRVSDRAGHI ADQARLGQED AWITHLMVAL VAGFAGVSAV NALVVSVSAR ARSFALLRLV GASRAQVAGM VAGEALAVSL AGVALGTATA LTGVAAVGHA LVGGGTVVLA VPLDQYLPLA GAVVGIGLLA SLVPAVAALR ARPLHAAG
|
| |