Gene Ndas_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2971 
Symbol 
ID9246824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3545497 
End bp3546609 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content72% 
IMG OID 
Productcytochrome oxidase assembly 
Protein accessionYP_003680887 
Protein GI297561913 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATT CCCCGGCCTC CGCGGTGCTG GCCGCGAACT CGGTCCCCCC GCTCGCGGCG 
GAGGACATCT CCCTTCTCGG ATTCCCCCTG TGGGGCTGGC AGATCGCCGT CGCGACCGCG
GGAGTACTCG CTCTGGTGCT GCTCGCGCGC ACGATCTGGA CACCCACGCA GCGGTCTCTG
CGGGCCTGGG CGCTCGGCAA CATCGTCGTC AACGCCGGAA TCGCCGTCAC CGGCGCCACC
GTGCGCGTCA CCTCCTCGGG CCTGGGCTGC TCGGAGTGGC CCAAGTGCAC GCCGGAGAGC
TTCGTGCCCA TCGACACCGG GCACGCCGCG CTCAACGCGG CCATCGAGTT CGGCAACCGG
ACGCTCACCT TCGTCGTGCT GGCGGTCGCC GTCATCACCT TCGTCGCGGT CATGCGGATG
AGCCCGCGCC GCCCCGACCT GGTCCGGCTG GCCGTCATCG TGCCGTTCGG CGTCCTGGGC
CAGGGCGTGG TCGGCGGCAT CACCGTGTGG ACCGACCTGC ACCCCGCCTC GGTGGCCGCG
CACTTCCTGC TGTCCATGGT GATGGTCTTC ATCACGGTCG CCCTGTACGT GCGCTGCCAG
GAGCCCGAGG GCAAGCCCCA GGTGTCCGCG GGGCCCATGC TGCACGCCCT GAGCGTCGGA
CTGGTCGTGG TCGGCTTCGT CCTGCTGGTC GCCGGGACCG TCGTCACCGG CACCGGCCCG
CACGGCGGCG ACGCCGCGGC CCCCCGCTGG GGGTTCGACC TGGCGGCGGT CACCCGCCTG
CACTCCGCGC TGGCCTGGCT CACCACGGCC GGGGCCGTGC TGGCCACGGT CATCGCCTTC
CGCGGCGGCG CCCGGCGGCC GGTCCGCATG AGCAGCGTGT TCCTGCTGGC CACCGTCGTC
CTCCAGGGCG TCGTGGGCTA CACCCAGTAC GCCCTGGAAC TGCCCGAGGC CCTGGTGGTC
CTGCACGTCC TGGGCTCGGC CCTGACCTGG GTCGCGATCT CCCGTCTGTA CTTCTCCACC
ACGCGCCTGG TCCCGCCGGG TCAGGAGCTC AGCGAGGACC CTGCTGACCG TGACGGGGAA
CGTAGTCCTG CTGTGCCCGC GCCTGGGCGA TGA
 
Protein sequence
MSDSPASAVL AANSVPPLAA EDISLLGFPL WGWQIAVATA GVLALVLLAR TIWTPTQRSL 
RAWALGNIVV NAGIAVTGAT VRVTSSGLGC SEWPKCTPES FVPIDTGHAA LNAAIEFGNR
TLTFVVLAVA VITFVAVMRM SPRRPDLVRL AVIVPFGVLG QGVVGGITVW TDLHPASVAA
HFLLSMVMVF ITVALYVRCQ EPEGKPQVSA GPMLHALSVG LVVVGFVLLV AGTVVTGTGP
HGGDAAAPRW GFDLAAVTRL HSALAWLTTA GAVLATVIAF RGGARRPVRM SSVFLLATVV
LQGVVGYTQY ALELPEALVV LHVLGSALTW VAISRLYFST TRLVPPGQEL SEDPADRDGE
RSPAVPAPGR