Gene Ndas_3975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3975 
Symbol 
ID9247846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4754391 
End bp4755515 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681878 
Protein GI297562904 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.125368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACAGGG GACCGGTACG GGCACTGGCG ACGACGGGGG CACTGACCCT CGTCCTGACA 
CTGGTCGGAG CGGGGGACGC CCCGGCGGGC GGGGCCCCGT CCTCTCCACG GCAACCCTCG
GACCGGCCCG CGGGGGTGAC GGAGCCGCCC TGGCGCTGGG GGTACGCCAC CGACGAGGAG
TGCACCGTCT CGGAGATCCT GGAGCCCTCG TGCGGCGTCT GGTGGGGGGC CAGCCCCTAC
CAGGACCGGA TCGAGCCCCT GGAGGAGGCG GTCGACCGCC GCATGGACAT CGTCTACACC
TGGCGCGGCG TCGACCAGGC CAACATCCCC GGCAGGCGCG AGCGGGAGCT GATCGCCGAG
GGCAGGTTCG TGCACACCAA CATCGAGGCC CGGCGGTTCA CGCGCTCCGG GCATCCGGCG
GTCTCCTACG AGTCCCTCAT CGACGGCGAG TTCGACGACT CGCTGCGCTC CCAGGCCCGC
GCCGTCGCGG AACTGGACGT GCCCTACTTC GTCACCTTCG ACCACGAGGC CGACGCCAAC
ACGCGCTACA ACAGGCGCGG CACGCCCGAG GAGTTCGTGC GGGCCTGGCG GCACATCGTG
GACCTGTACC GCGAGGAGGG CGCGGACAAC GCCATCTGGG TGTGGAACGT GACCGGCTGG
GAGGGCAACT TCGACCGCCT CCCCGGCCTG TGGCCCGGCA ACGACTACGT CGACTGGGTC
AGCTGGGAGG CGTACAACAT GACCGGCTGC GACTCCCAGC CGCACTGGGA CGAGGTGTAC
TCCTTCGAGG ACATGATGCG CCCGGCCTAC GAGTGGTTCC AGAACGAGGG GCCCGACCAC
GGGATCGACC CGGACAAGCC GGTGATGATC GGGGAGATGG GCACCACGCC CATCGGCTCG
CAGGAGACCC TGGAGTGGTA CTCCGAGATC CCCGACGTGC TGCGCCGCTA CGAGCGGGTG
CGCGCGGTCA AGGTGTGGGA CAACAAGCTG TCCCCGGACT GCGACTTCCG GATCCGGGCC
AACGAGTACG CCCAGCGCGG CTTCGAGGCC GCCGGACAGG ACCCGTACGT GTACCTGCCC
GAGCGGGTGC GCCGCCTGGC CGAGTACGCC CAGCAACGGG GTTGA
 
Protein sequence
MDRGPVRALA TTGALTLVLT LVGAGDAPAG GAPSSPRQPS DRPAGVTEPP WRWGYATDEE 
CTVSEILEPS CGVWWGASPY QDRIEPLEEA VDRRMDIVYT WRGVDQANIP GRRERELIAE
GRFVHTNIEA RRFTRSGHPA VSYESLIDGE FDDSLRSQAR AVAELDVPYF VTFDHEADAN
TRYNRRGTPE EFVRAWRHIV DLYREEGADN AIWVWNVTGW EGNFDRLPGL WPGNDYVDWV
SWEAYNMTGC DSQPHWDEVY SFEDMMRPAY EWFQNEGPDH GIDPDKPVMI GEMGTTPIGS
QETLEWYSEI PDVLRRYERV RAVKVWDNKL SPDCDFRIRA NEYAQRGFEA AGQDPYVYLP
ERVRRLAEYA QQRG