Gene Ndas_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0334 
Symbol 
ID9244169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp410255 
End bp411895 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content68% 
IMG OID 
ProductATP synthase F1, alpha subunit 
Protein accessionYP_003678288 
Protein GI297559314 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.297297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAGC TGACGATCCG GCCGGACGAG ATCCGGGACG CGCTACAGCG CTTCGTCCAG 
TCGTACGAGC CTGAAGCCAC CCGCGCGGAA GAGGTCGGTA CCGTCACCTA CTCCGGTGAC
GGCATCGCCC GCGTCGGTGG CCTTCCCTCG GCGATGGCGA ACGAGCTGCT CCAGTTCGAG
GACGGCACCC TGGGCCTGGC CCAGAACCTC GAGATCGGTG AGATCGGTGT CGTCGTGCTG
GGTGACTTCA CCAGCATCGA GGAGGGCCAG AAGGTGCGCC GCACCGGCCA GCTCCTCTCG
GTGCCGGTGG GTGACAACTT CCTCGGCCGC GTGGTGGACC CCCTGGGCGC CCCCATCGAC
GGCAAGGGTG AGATCGAGTC CACCGAACGC CGTGAGCTGG AGCTCCAGGC GGCGACCGTG
ATGGAGCGCA AGCCCGTCCA CGAGCCGCTC CAGACCGGTA TCAAGGCGAT CGACTCGATG
ACCCCGGTCG GCCGCGGCCA GCGCCAGCTG GTCATCGGCG ACCGCCAGAC CGGCAAGACC
GCGGTCTGCA TCGACGCGAT CATCAACCAG AAGGCCAACT GGGAGTCGGG CGACCCCGAC
AAGCAGGTGC GCTGCATCTA CGTCGCGATC GGCCAGAAGG GCTCGACCAT CGCCGGTGTG
CGTGGCGCCC TCGAAGAGGC CGGCGCGATG GAGTACACCA CCATCGTCGC CGCCCCGGCG
TCCGAGGCGG CCGGCTTCAA GTACCTGGCC CCCTACACCG GCTCGGCCCT GGGCCAGCAC
TGGATGTACG AGGGCAAGCA CGTCCTCATC GTCTTCGACG ACCTCACCAA GCAGGCCGAG
GCCTACCGTG CGGTGTCGCT GCTGCTGCGC CGCCCGCCGG GCCGCGAGGC CTACCCCGGT
GACGTCTTCT ACCTGCACTC CCGGCTGCTG GAGCGCTGCG CCAAGCTCTC CGACGAGATG
GGCAAGGGGT CGATGACCGC CCTGCCGATC ATCGAGACCA AGGCGGGCGA CGTCTCGGCG
TACATCCCCA CCAACGTCAT CTCCATCACC GACGGCCAGG TCTTCCTGGA GTCGGACCTG
TTCAACCAGG GCCAGCGCCC GGCGATCAAC GTCGGTGTGT CGGTCTCCCG TGTCGGTGGC
GCCGCGCAGA CCAAGGCCAT GAAGAAGGTC TCGGGCACCC TGCGGCTGGG CCTGGCCCAG
TACCGCGAGC TGGAGGCGTT CTCCGCCTTC GGTTCGGACC TGGACGCCGT CTCCAAGCAG
CAGCTGGAGC GCGGTGCCCG CCTGATGGAG CTCCTCAAGC AGGGCCAGTA CTCGCCGTTC
TCCATGGAGA AGCAGGTCGT CTCGATCTGG GCCGGCACCA CCGGCCGCGT CGACGACGTC
CCGGTCGAGG ACGTGCGCCG CTTCGAGGAG GACTTCCTCG ACCACCTGAG CCGCGAGCAC
CAGGGCATCC TCGACACCAT CCGCGAGAGC GGCAAGTTCG AGGACGAGAC CGAGAAGTCC
CTCGACTCGG CGCTGGAGAA GTTCAAGCAG GGCTTCCAGA CCTCCGCCGG GACCCTCCTG
GGCACCGAGG CCGAGGCCGA GGCGCTGGAC GAGGAGAAGG TCGGCCAGGA GACCATCAAG
GTCGCCAAGG GCGGGAAGTA A
 
Protein sequence
MAELTIRPDE IRDALQRFVQ SYEPEATRAE EVGTVTYSGD GIARVGGLPS AMANELLQFE 
DGTLGLAQNL EIGEIGVVVL GDFTSIEEGQ KVRRTGQLLS VPVGDNFLGR VVDPLGAPID
GKGEIESTER RELELQAATV MERKPVHEPL QTGIKAIDSM TPVGRGQRQL VIGDRQTGKT
AVCIDAIINQ KANWESGDPD KQVRCIYVAI GQKGSTIAGV RGALEEAGAM EYTTIVAAPA
SEAAGFKYLA PYTGSALGQH WMYEGKHVLI VFDDLTKQAE AYRAVSLLLR RPPGREAYPG
DVFYLHSRLL ERCAKLSDEM GKGSMTALPI IETKAGDVSA YIPTNVISIT DGQVFLESDL
FNQGQRPAIN VGVSVSRVGG AAQTKAMKKV SGTLRLGLAQ YRELEAFSAF GSDLDAVSKQ
QLERGARLME LLKQGQYSPF SMEKQVVSIW AGTTGRVDDV PVEDVRRFEE DFLDHLSREH
QGILDTIRES GKFEDETEKS LDSALEKFKQ GFQTSAGTLL GTEAEAEALD EEKVGQETIK
VAKGGK