Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1497 |
Symbol | |
ID | 9245347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1835057 |
End bp | 1836415 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | protein of unknown function DUF245 domain protein |
Protein accession | YP_003679433 |
Protein GI | 297560459 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00249472 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGAACGAC GGATCTTCGG GCTGGAGAAC GAGTACGGGG TCACGTGCAC CTTCCGTGGA CAGCGCCGCC TGTCCCCGGA CGAGGTCGCC CGCTACCTCT TCCGCAGGGT GGTCTCCTGG GGGCGCAGCA GCAACGTGTT CCTGCGCAAC GGCGCCCGCC TGTACCTGGA CGTGGGCAGC CACCCCGAGT ACGCCACCCC CGAGTGCGAC AGTCTCGTGG ACCTGGTGGC CCACGACAAG GCCGGCGAGC GCATCCTGGA GGGGCTGCAG GTCGACGCCG AGCAGCGCCT GCACGAGGAG GGCATCGCCG GGGACATCTA CCTGTTCAAG AACAACACCG ACTCGGCGGG CAACTCCTAC GGCTGCCACG AGAACTACCT CGTGGGACGG CACGGCGAGT TCGGACGCCT GGCCGACGTG CTCATCCCCT TCCTGGTCAC CCGCCAGATC ATCTGCGGCG CCGGAAAGGT GCTCCAGACC CCCCGCGGCG CCCTGTTCTG CGTCAGCCAG CGCGCCGAGC ACATCTGGGA GGGCGTCTCC TCGGCCACCA CCCGCTCGCG CCCCATCATC AACACCCGCG ACGAGCCGCA CGCGGACGCC GAGCGCTTCC GGCGCCTGCA CGTCATCGTC GGCGACTCCA ACATGAGCGA GACCACCAAC CTGCTCAAGC TGGGCTCCAC CGACCTGGTG CTGCGCATGA TCGAGGCCGG GGTGGTCATG CGCGACTACA CGCTGGAGAA CCCGATCCGG GCCATCCGCG AGGTCAGCCA CGACATGACC GGCCGCCGCA AGGTACGCCT GGCCAACGGG CGCGAGGCCA GCGCGCTGGA GATCCAGCGC GAGTACCTGG ACAAGGTGCA GAGCTACGTC GACCGGCACG GCACCGACGC CACCGGCAAG CGCGTCCTGG AGCTGTGGCA GCGCACCCTG GAGGCGGTCG AGACCCAGAA CCTGGAGACC GTCTCCCGCG AGATCGACTG GGTGGCCAAG TACCTGCTGC TGGAGCGCTA CCGCGACAAG CACGACCTGT CCCTGTCCTC GCCGCGGGTG GCCCAGCTCG ACCTGACCTA CCACGACATC CACCGCGACC GGGGACTGTT CTACCTGCTC CAGGGCCGCG GCCAGATGGA ACGGGTGGTC GGCGACCTCA AGATCTTCGA GGCCAAGTCG GTGCCGCCGC AGACCACGCG GGCCCGGCTG CGCGGCGAGT TCATCCGGCG CGCCCAGGAG CAGCGCCGCG ACTTCACGGT GGACTGGGTG CACCTCAAGC TCAACGACCA GGCCCAGCGC ACGGTGCTGT GCAAGGACCC CTTCAAGTCG GTGGACGAGC GGGTGGAGAA GCTCATCGCC GGTATGTAG
|
Protein sequence | MERRIFGLEN EYGVTCTFRG QRRLSPDEVA RYLFRRVVSW GRSSNVFLRN GARLYLDVGS HPEYATPECD SLVDLVAHDK AGERILEGLQ VDAEQRLHEE GIAGDIYLFK NNTDSAGNSY GCHENYLVGR HGEFGRLADV LIPFLVTRQI ICGAGKVLQT PRGALFCVSQ RAEHIWEGVS SATTRSRPII NTRDEPHADA ERFRRLHVIV GDSNMSETTN LLKLGSTDLV LRMIEAGVVM RDYTLENPIR AIREVSHDMT GRRKVRLANG REASALEIQR EYLDKVQSYV DRHGTDATGK RVLELWQRTL EAVETQNLET VSREIDWVAK YLLLERYRDK HDLSLSSPRV AQLDLTYHDI HRDRGLFYLL QGRGQMERVV GDLKIFEAKS VPPQTTRARL RGEFIRRAQE QRRDFTVDWV HLKLNDQAQR TVLCKDPFKS VDERVEKLIA GM
|
| |