Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0334 |
Symbol | |
ID | 9244169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 410255 |
End bp | 411895 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | ATP synthase F1, alpha subunit |
Protein accession | YP_003678288 |
Protein GI | 297559314 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.297297 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGAGC TGACGATCCG GCCGGACGAG ATCCGGGACG CGCTACAGCG CTTCGTCCAG TCGTACGAGC CTGAAGCCAC CCGCGCGGAA GAGGTCGGTA CCGTCACCTA CTCCGGTGAC GGCATCGCCC GCGTCGGTGG CCTTCCCTCG GCGATGGCGA ACGAGCTGCT CCAGTTCGAG GACGGCACCC TGGGCCTGGC CCAGAACCTC GAGATCGGTG AGATCGGTGT CGTCGTGCTG GGTGACTTCA CCAGCATCGA GGAGGGCCAG AAGGTGCGCC GCACCGGCCA GCTCCTCTCG GTGCCGGTGG GTGACAACTT CCTCGGCCGC GTGGTGGACC CCCTGGGCGC CCCCATCGAC GGCAAGGGTG AGATCGAGTC CACCGAACGC CGTGAGCTGG AGCTCCAGGC GGCGACCGTG ATGGAGCGCA AGCCCGTCCA CGAGCCGCTC CAGACCGGTA TCAAGGCGAT CGACTCGATG ACCCCGGTCG GCCGCGGCCA GCGCCAGCTG GTCATCGGCG ACCGCCAGAC CGGCAAGACC GCGGTCTGCA TCGACGCGAT CATCAACCAG AAGGCCAACT GGGAGTCGGG CGACCCCGAC AAGCAGGTGC GCTGCATCTA CGTCGCGATC GGCCAGAAGG GCTCGACCAT CGCCGGTGTG CGTGGCGCCC TCGAAGAGGC CGGCGCGATG GAGTACACCA CCATCGTCGC CGCCCCGGCG TCCGAGGCGG CCGGCTTCAA GTACCTGGCC CCCTACACCG GCTCGGCCCT GGGCCAGCAC TGGATGTACG AGGGCAAGCA CGTCCTCATC GTCTTCGACG ACCTCACCAA GCAGGCCGAG GCCTACCGTG CGGTGTCGCT GCTGCTGCGC CGCCCGCCGG GCCGCGAGGC CTACCCCGGT GACGTCTTCT ACCTGCACTC CCGGCTGCTG GAGCGCTGCG CCAAGCTCTC CGACGAGATG GGCAAGGGGT CGATGACCGC CCTGCCGATC ATCGAGACCA AGGCGGGCGA CGTCTCGGCG TACATCCCCA CCAACGTCAT CTCCATCACC GACGGCCAGG TCTTCCTGGA GTCGGACCTG TTCAACCAGG GCCAGCGCCC GGCGATCAAC GTCGGTGTGT CGGTCTCCCG TGTCGGTGGC GCCGCGCAGA CCAAGGCCAT GAAGAAGGTC TCGGGCACCC TGCGGCTGGG CCTGGCCCAG TACCGCGAGC TGGAGGCGTT CTCCGCCTTC GGTTCGGACC TGGACGCCGT CTCCAAGCAG CAGCTGGAGC GCGGTGCCCG CCTGATGGAG CTCCTCAAGC AGGGCCAGTA CTCGCCGTTC TCCATGGAGA AGCAGGTCGT CTCGATCTGG GCCGGCACCA CCGGCCGCGT CGACGACGTC CCGGTCGAGG ACGTGCGCCG CTTCGAGGAG GACTTCCTCG ACCACCTGAG CCGCGAGCAC CAGGGCATCC TCGACACCAT CCGCGAGAGC GGCAAGTTCG AGGACGAGAC CGAGAAGTCC CTCGACTCGG CGCTGGAGAA GTTCAAGCAG GGCTTCCAGA CCTCCGCCGG GACCCTCCTG GGCACCGAGG CCGAGGCCGA GGCGCTGGAC GAGGAGAAGG TCGGCCAGGA GACCATCAAG GTCGCCAAGG GCGGGAAGTA A
|
Protein sequence | MAELTIRPDE IRDALQRFVQ SYEPEATRAE EVGTVTYSGD GIARVGGLPS AMANELLQFE DGTLGLAQNL EIGEIGVVVL GDFTSIEEGQ KVRRTGQLLS VPVGDNFLGR VVDPLGAPID GKGEIESTER RELELQAATV MERKPVHEPL QTGIKAIDSM TPVGRGQRQL VIGDRQTGKT AVCIDAIINQ KANWESGDPD KQVRCIYVAI GQKGSTIAGV RGALEEAGAM EYTTIVAAPA SEAAGFKYLA PYTGSALGQH WMYEGKHVLI VFDDLTKQAE AYRAVSLLLR RPPGREAYPG DVFYLHSRLL ERCAKLSDEM GKGSMTALPI IETKAGDVSA YIPTNVISIT DGQVFLESDL FNQGQRPAIN VGVSVSRVGG AAQTKAMKKV SGTLRLGLAQ YRELEAFSAF GSDLDAVSKQ QLERGARLME LLKQGQYSPF SMEKQVVSIW AGTTGRVDDV PVEDVRRFEE DFLDHLSREH QGILDTIRES GKFEDETEKS LDSALEKFKQ GFQTSAGTLL GTEAEAEALD EEKVGQETIK VAKGGK
|
| |