Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3594 |
Symbol | |
ID | 9247463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4307776 |
End bp | 4308789 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | NusA antitermination factor |
Protein accession | YP_003681500 |
Protein GI | 297562526 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTTC TGCGCAGCCT GGAACGCGAG AAGGACATCT CCGTCGAACT CGTGGCCAAG GCCATCGAGG ACGCGCTTCT GATCGCCTAC CACCGCGAGG AGGGCCCGGA GAAGCGGGCC CGCGTCGAGC TCAACCGCTC CACCGGCCAC GTGACCGTGT GGGTGGCCGA GACCGACGAG GACGGCGAGT CCGTCGGCGA GTTCGACGGC ACCCCCACCG GGTTCGGCCG CATCGCGACC TCCACCGCCA AGCAGGTCAT CCTCCAGCGT CTGCGCGACG CCGAGGACGA GCTGACCCTG GGCGAGTTCG CCGGGCGCGA GCACGACATC GTCTCCGGCA TCATCCAGCA GGGCAAGGAC CCCCGCAACG TGCTGGTGGA CCTCGGCAAG ATCGAGGCGG TCCTGCCCCC GCAGGAGCAG GTGCCCACCG AGACCTACAC CCACGGCGAG CGCCTGCGCG CCTACGTGGT GCAGGTCCGC AAGGGCCACC GCGGCCCCTC GGTCACCCTC TCGCGCACGC ACCCCAACCT GGTGCGCAAG CTCTTCGAGC TGGAGGTCCC CGAGATCGCC GACGGCACCG TGGAGATCGC GGCGATCGCC CGGGAGGCGG GCCACCGCAC CAAGATGGCC GTCCACTCCA ACCGCGGCGG GGTCAACGCC AAGGGCGCCT GCATCGGCCC GCTCGGCAGC CGCGTCCGCA ACGTCATGGC CGAACTCCAC GGGGAGAAGA TCGACATCGT CGACTACTCC GAGGACCCGG CCGTGTTCGT CGCCAACGCC CTGTCCCCGG CGCGGGTCAA CTCCGTGGAA ATCCTCGACA TGGCCTCCCG CGTCGCCCGC GTGATCGTGC CCGACTACCA GCAGTCCCTC GCGATCGGCA AGGAGGGCCA GAACGCCCGT CTGGCCGCAC GTCTGACCGG CTGGCGCATC GACATCCGCT CCGACGCCGA GCCCGCCGGT GAGCCGGAAC GGGAGGGCGC CGCCGAGGGT GACGACACCG CCGCGACGGG CTGA
|
Protein sequence | MSVLRSLERE KDISVELVAK AIEDALLIAY HREEGPEKRA RVELNRSTGH VTVWVAETDE DGESVGEFDG TPTGFGRIAT STAKQVILQR LRDAEDELTL GEFAGREHDI VSGIIQQGKD PRNVLVDLGK IEAVLPPQEQ VPTETYTHGE RLRAYVVQVR KGHRGPSVTL SRTHPNLVRK LFELEVPEIA DGTVEIAAIA REAGHRTKMA VHSNRGGVNA KGACIGPLGS RVRNVMAELH GEKIDIVDYS EDPAVFVANA LSPARVNSVE ILDMASRVAR VIVPDYQQSL AIGKEGQNAR LAARLTGWRI DIRSDAEPAG EPEREGAAEG DDTAATG
|
| |