Gene Ndas_3602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3602 
Symbol 
ID9247471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4315807 
End bp4317075 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content72% 
IMG OID 
Productcytochrome bd ubiquinol oxidase subunit I 
Protein accessionYP_003681508 
Protein GI297562534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.852555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAAG ACCCCCTCCT GCTGGCGCGG CTCCAGTTCG CGCTGACCGC CGCCACCCAC 
TACATGTTCG TGGCCCTGAC CCTCGGGCTG GCCCCCTACC TCCTCGCCAC CCAGCTGGTC
GCCGCCCTGC GCGGCGACCG CTCCCGGACG ACCGCCGTGC GGTTCTGGGG CGGCCTGTAC
CTGGTCAACT ACGGGATGGG CGTCCTGTCC GGGCTGGTGA TGGAACTCCA GCTCGCGCTG
AACTGGAGCG GCCTGCACGG GATGTTCGGC TACACCTTCG CCGCTCCGCT GGCGCTGGAG
ACGATGACCG CGTTCTTCGT CGAGTCCACG TTCCTGGGTT TGTGGATCTT TGGGTGGGAC
CGCATGGGCC GGTGGGCGCA CCTGGCCTGC TTCGCCGTGG TCACCGCGAC GGCGTACGCC
TCGGCGTGGT GGGTGCTGGT CTCCAACGGC TTCCTGCGCA ACCCCGTGGG CTTCGAGATG
GTCGACGGGG TGGCGCACCT GACCGACCCC GTCGCGCTGA TGACCAACCC GGCGGCCGTG
CTGGCCTTCG GTCACATCGT CACCGGTTCC CTGCTGGTCG GCGCCCTGGT CGTCGCGGCG
ACCAGCGCCT ACCACCTCGT CCGTCGCGAC GACGCGCACG GCGTCTTCCG GCGCGGTATC
CGCCACGCGA CGCTGGTCCT GTGCGTCGTG CCGATACCCG TGGCGGCCTT CGGCGGGGTG
CAGTTCGGGC TGTTCGGCCA GGACCCGCCC ACGAGCGGGC TGACCTACAC CGCCGAGGAG
ATCGCGGCGA TCGAGGCGGC GCACCCGGGC GGCCCCCTCC TGGAGGCGGC CAACACGGCC
GGCGACCTGG TGATGATGAC GTCGTGGGCG CTGGTGTTCC TCCTGGGCCC CCTCATGCTG
CTGGCCTGGC CCCTCGGCGG TCTCGACCGC TGGAGGTGGT TCCTCGCCCC GCTGGTGGTG
ACGCCGTTCC TGCCCTACCT GGCCAGCGTC GGCGGCTGGG TGTTCCGGGA GACCAACCGC
CAGCCGTGGA CGGTCGTGCA CCACCTGACC ACGGCCGACG CGGTGACCCC CCTGTCCCCG
GTCGCGGCCG TGGCCTCCTT CGGTTTCTTC ACAGCCGCCT TCGCGGCCCT GGCCGCCGTC
ACCTACTGGC TCCTGGTGCG CTACGCGCGG CGCGGCCCCG AGGGCGGGCC GCTGGCGGAG
CAGCGCACGC AGCCGCCCGA GGGCCCCGCG GAGCCCGGTG GGTCCGCCGT CCCCGTCCAC
ACGTACTGA
 
Protein sequence
MLEDPLLLAR LQFALTAATH YMFVALTLGL APYLLATQLV AALRGDRSRT TAVRFWGGLY 
LVNYGMGVLS GLVMELQLAL NWSGLHGMFG YTFAAPLALE TMTAFFVEST FLGLWIFGWD
RMGRWAHLAC FAVVTATAYA SAWWVLVSNG FLRNPVGFEM VDGVAHLTDP VALMTNPAAV
LAFGHIVTGS LLVGALVVAA TSAYHLVRRD DAHGVFRRGI RHATLVLCVV PIPVAAFGGV
QFGLFGQDPP TSGLTYTAEE IAAIEAAHPG GPLLEAANTA GDLVMMTSWA LVFLLGPLML
LAWPLGGLDR WRWFLAPLVV TPFLPYLASV GGWVFRETNR QPWTVVHHLT TADAVTPLSP
VAAVASFGFF TAAFAALAAV TYWLLVRYAR RGPEGGPLAE QRTQPPEGPA EPGGSAVPVH
TY