Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0001 |
Symbol | |
ID | 9248755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 308 |
End bp | 2266 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | chromosomal replication initiator protein DnaA |
Protein accession | YP_003677960 |
Protein GI | 297558986 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000505913 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00770539 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGGCTGACG CGCAGGTCAA CCTCGCAGTG GTCTGGTCCA GCGTCCTGGA CGGGATCGAC AACGACTCCC TTCCCGCGCA CCAGCGCGCA TGGCTGCCGC AGACCCGTCC GCTGGGCCTG ATCGAGGACA CCGCGCTCAT CGCGGCGCCC AACGAGTTCA CCAAGAAGGT GCTGGAGACA CGCCTGTACC CGGCGATCAG CAAGGCGCTC TCCGCCCACC TGGGCCGGGA GATCCGGGTC GCGGTGACGG TCGACCCGAC CGCGGTGCCG ACCCCGCCGC CCACCCCGAC GCCGCCCGCC GCTCCGGGGC CGCAGGGCCC GCGCAGCGTG GAGGGGCACA CCGCGCCCGC CTTCTCCCCC GTCGTGCCGG GCACCGCCCC CGGCCCGACC CGGCCGACGG CTCCCCCGGC GGCGCCCGCG GACGCCCAGC GGGCCGTGAC CGGCTCCCGG GGACAGCACG CGCGTCCGGC AGCCGAGGCG GCCGAGCAGC CCGCACCGTG GCCCTCGGCC CCCGCTCCGG CCCCGGAGCA GGCCGACCTG CTGGGACCCA CGCCCCCGGC CGACCACCCC TCCCCCACCG ACGAGGCGCC GCACACGGTC CGCGACGCGC CCGCGCCCTG GGTCCAGCAG ACCTTCGCCT CCCCCCAGGA AACGCCCACC GCGCCCCCGC TGTGGGAGCA GCCCTCGGCG CTGGAGCGGC CGCAGGAGCC CGCGGGGTGG CGGCAGCAGT CCTGGACGGA ACCGGAGTGG GACCGGCCCA ACCGCTGGGA GACCCCGCGG GAGGAGGCGC CCGCGCCGGG GGCGGAGGCG GCCGAGGAGG CCGCGCAGCC CGCTCCGGAG GCCGTGGAGG CCCCTGTGGA GGACGAGGCC CCGAAGGCCC CCTCCTCTCC CCCGGGGCGC TCGCAGGAGG TCCCGCCGGG CGAGCACGCG CGCCTGAACC CGAAGTACAC CTTCGACACC TTCGTCATCG GCTCCAGCAA CCGCTTCGCG CACGCGGCGT CGGTGGCGGC CGCCGAGGCT CCGGCCAAGG CCTACAACCC GCTGTTCATC CACGGCGGTT CGGGGCTGGG CAAGACCCAC CTGCTGCACG CCATCGGGCA CTACACGCAC CGCCTCTACG AGGGTTCGCG GGTGCGGTAC GTCAGCTCGG AGGAGTTCAC CAACGAGTTC ATCAACTCGA TCCGGGACGG CAAGGCGGAC GGGTTCCGCC GCCGGTACCG CGACATCGAC GTGCTCCTGG TGGACGACAT CCAGTTCCTG GAGAACAAGG AGCAGACGCA GGAGGAGTTC TTCCACACCT TCAACACGCT GCACAACTCC GACAAGCAGA TCGTCATCTC CAGCGACCGG CCGCCCAAGC AGCTGACCAC GCTGGAGGAC CGGATGCGCA GCCGGTTCGA GTGGGGCCTG CTCACCGACG TCCAGCCGCC CGAGCTGGAG ACCCGGATCG CCATCCTGCG CAAGAAGGCC GCGCAGGAGG GGCTGGCGGC CCCTCCCGAG GTGCTGGAGT TCATCGCGAG CAAGATCTCC ACCAACATCC GGGAACTGGA GGGCGCGCTG ATCCGCGTGA CCGCCTTCGC CAGCCTGAAC CGGCAGTCGG TGGACCTGGA CCTGACCAGC CAGGTGCTGC GGGACCTGGT GCCCAGCACG GAGGTCCCCG AGGTCACGGC GGGGGCGATC ATGTCCCAGA CGGCCGCCTA CTTCGGCCTG ACGGTCGAGG ACCTGTGCGG GACCTCGCGC TCCCGGGTGC TGGTGACCGC CCGGCAGATC GCCATGTACC TGTGCCGGGA GCTGACCGAG CTGTCCCTGC CCAAGATCGG GCAGCAGTTC GGCCGCGACC ACACCACGGT CATGCACGCC GAGCGCAAGG TCCGCGGGCT GATGGCCGAG CGCCGGTCCA TCTACAACCA GGTGCACGAG CTCACCAGCC GGATCAAGGA CCAGCCCTCG CTGACCTGA
|
Protein sequence | MADAQVNLAV VWSSVLDGID NDSLPAHQRA WLPQTRPLGL IEDTALIAAP NEFTKKVLET RLYPAISKAL SAHLGREIRV AVTVDPTAVP TPPPTPTPPA APGPQGPRSV EGHTAPAFSP VVPGTAPGPT RPTAPPAAPA DAQRAVTGSR GQHARPAAEA AEQPAPWPSA PAPAPEQADL LGPTPPADHP SPTDEAPHTV RDAPAPWVQQ TFASPQETPT APPLWEQPSA LERPQEPAGW RQQSWTEPEW DRPNRWETPR EEAPAPGAEA AEEAAQPAPE AVEAPVEDEA PKAPSSPPGR SQEVPPGEHA RLNPKYTFDT FVIGSSNRFA HAASVAAAEA PAKAYNPLFI HGGSGLGKTH LLHAIGHYTH RLYEGSRVRY VSSEEFTNEF INSIRDGKAD GFRRRYRDID VLLVDDIQFL ENKEQTQEEF FHTFNTLHNS DKQIVISSDR PPKQLTTLED RMRSRFEWGL LTDVQPPELE TRIAILRKKA AQEGLAAPPE VLEFIASKIS TNIRELEGAL IRVTAFASLN RQSVDLDLTS QVLRDLVPST EVPEVTAGAI MSQTAAYFGL TVEDLCGTSR SRVLVTARQI AMYLCRELTE LSLPKIGQQF GRDHTTVMHA ERKVRGLMAE RRSIYNQVHE LTSRIKDQPS LT
|
| |