Gene Ndas_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0804 
Symbol 
ID9244649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp992383 
End bp993432 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003678754 
Protein GI297559780 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.633218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCCCG ACGACCTGAG CCGAGTGGAC GAATTCGCCG ACTTCTGGGC TCCCGACCCT 
CCCGAACTCA GCTGGCCGGA CCGCGCCCGG GCCCTGGCGG GGTTCCTCGC CGCCGCGGCG
CGCCCGGACC GCGCGCGTGT GGCCGCGCTG GCGGTGGCCG TGCTGGGGAG CTTCCTGTTC
GCGCCCCAGG GCACGGCGGT GGCGGCGCCG GTGCCCGCCG AGCCGGAGGA CATGGGCGCC
CTCCAGGATC GCGCGGAGGC CCTGAGCGAG GAGTTCAACG GCGAACTGCG CGACATGGAG
GGCGTCATCC AGGAGGCCGA GCGCGCCGAG GAACGGGCCC AGAGCACCCG CGAGGACGTG
GAGGAGGCGC GCGAGCAGGT GCGCGCCCTG GCGGTGGCCA CCTACACCAG CAGCGGGATC
GACCTGTCCA TGTCGCTGTT CGTCGAGGCC GACCCCGACG AGGTCATCGA CCGCGCGGTG
GTGATCAACT ACCTGTCCAC CAGCAACCAG GACAAGATCG ACCAGCTCAG TGAGGCCCTG
GAGCGCGACG AGACCGCGCA GCAGAACGCC GAGGAGCAGC TGGCCGCGGC CGAGGAGGAC
CTGGACGAGC TGGAGGGGCG CCGCGAGGAG GTCCAGGAGA TGATCGCGGA CCACCCCGTG
CAGCCGATGG GCGGCCAGTA CAACATCACC CCGCGCACCG AGCAGATGCG CGAGCTGATC
ATCGAGAAGT TCGGCGAGGG CACAGACGTG GGCGGCGTGG GCTGCTACCG GGAGGTCGGC
GGCTGGGTGG TCGGCGAGCA CCCCAAGGGC CGCGCCTGCG ACTTCATGGT GGACCCCAAC
GGGAACACGC CCTCACAGGA GCAGATCGAC CGCGGCTACG CGATCGCCGA GTGGGCCCAG
GAGAACGCCG ACCGCCTCGG CATCATGTAC ATCATCTACC GGCAGCAGAT CTGGGACATC
CGCCGTGGTG ACGAGGGCTG GCGCGACATG GCCGACCGCG GCAGCATCAC CGAGAACCAC
TTCGACCACG TGCACATCTC GATGTTCTGA
 
Protein sequence
MQPDDLSRVD EFADFWAPDP PELSWPDRAR ALAGFLAAAA RPDRARVAAL AVAVLGSFLF 
APQGTAVAAP VPAEPEDMGA LQDRAEALSE EFNGELRDME GVIQEAERAE ERAQSTREDV
EEAREQVRAL AVATYTSSGI DLSMSLFVEA DPDEVIDRAV VINYLSTSNQ DKIDQLSEAL
ERDETAQQNA EEQLAAAEED LDELEGRREE VQEMIADHPV QPMGGQYNIT PRTEQMRELI
IEKFGEGTDV GGVGCYREVG GWVVGEHPKG RACDFMVDPN GNTPSQEQID RGYAIAEWAQ
ENADRLGIMY IIYRQQIWDI RRGDEGWRDM ADRGSITENH FDHVHISMF