Gene Ndas_2693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2693 
Symbol 
ID9246544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3209058 
End bp3210707 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680614 
Protein GI297561640 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.613106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.221929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCCC AACCTCCCGG CCCCCTGGAG CAGGGCGAGG TCCTCCAGGA CCTCGCCGGG 
CGACTGGTCG CGGTCGTCCC GGAGGGCTGG CAGCAGCTCA CCTACCTGGC CAGGGTGATC
GGCGCCCACC GCAGCGACAT GCTCGCCGTC CAGGAGGCGG ACGGCCGGGT CCGGCAGCTG
GTGGTGCCCG GCGGTGTCGG CGACCTGGTC GACGCGCTCA AGCGGTCCGG GTTCCGGGAG
GGCGGGGGCA CCTGGCTCTC CATGGTCCTC TCGGTCCACC ACACGCACCA GTTCAACGTC
GAGTACAACC ACGACACCGA GCCCGACCTG CCGCCCGGGA CGAGCCCGCT CGTCTACGCC
CAGGAACTGG AGCGCTTTCC GCGCGCGCAC GACCGGATCC CCGACTGGCT CGGCGCCCGG
CTGGAGCAGG CGCGTGAACT GGATCCCGAG CGGATGCGCG AGGAGGTCGG CGCGGCCCTG
GTGCGGGCCT GCGAGCGGGA GGGGCTGCGC GCCGACTTCC TGCCGCCGAC CCGCCTGCGC
GTGTTCGACT TCGGGGGCGC GGTCCTCATG GAGGCCGACA TGAGGGAGAC CTTCGACCAG
GCGGTCATCG CCGCGGAGGA GCAGCGGACC GACCTGGCCG CCCGCTTCGC GGACTTCATG
GCCGATGCCG CTCGGGAGCG GGAGCAGGCC GCCGACGGGT CCTCCGCCGA GGCGTCCTCC
ACCGGTACGC GGGACGCCGA CCCGGCCGGC GCTCCCTCAC CGCCGGACCC CGACGACACG
GTGGCCGTCT CCCTGGCGGC GGCCTTCGCC GAGGCCGGGG TGGGCGCCGC CTTCCAGGGC
GCCGACACGC TCGTCGTGAC GCTGCCCGAC GGCAACCACG CCAGCGCCGA CATCAGCGGA
CTGCGCGCCG CACTGGGCGA GGCCACCCCC GAGCAGATCG CGCACAACAC CGCCCAGTTC
GCGCGCACGT CCGTCGAACA GCTGAGCCAG GCCACCGGAC AGGGAGGCGG CGACACCGAC
GGACGGCTCC GGGTACGCCT GTACCCCGCC TCCGCCTTCC CCGAGGGCGT ACTGGACTCC
CTCCTGACCC GCGAGATCGC CCCCGGGCTG TGGCAGACCG TGGTCGTGGA CGCCTCCGAC
TCGCTGAGGC CGCTGCCCCG CCAGGTGCAC GAACGCTCCG GCCGCCCCGA CGGCGAGGTC
TTCGCCGAGG CGGTGGCCGC CTCCGTGGCC GAGGCCGTCG AGGTCAGCGA GCACGAGGTC
GACGGGGCGC GCATCGTGCA CATCGGCGGC CAGCACCCCT ACGTGGCGGC GCACGCGCAC
GACCTCGACC GCCACCTCGG CGACCTGCCG CACGGCGCGC TCGTGGCCTT CCCCGTTCCC
GAGGTGCTCC TGGCCCACCC CCTGGGGAAG GGCCACCCGA TCGCCGCCCT GGACCACATG
CAGCAGGTCG CCGAGCGGTT CACCGCCGAC GGCGACAAGC CCGTCAGCGC CCAGCTCTAC
TGGTGGCACC CCGGTTCGCG CTCGCGCGAT CGGGGCACGC CGCCCGACCT GCGTCCCGTG
GGGGCCAGGA TCGACCACGA GAACAGGTCG GTGGAGCTGC TGACCTCCGA CGAGGAGTTC
GGGCCCATGC TCGCCTCCCT GGTCGGGTAG
 
Protein sequence
MTPQPPGPLE QGEVLQDLAG RLVAVVPEGW QQLTYLARVI GAHRSDMLAV QEADGRVRQL 
VVPGGVGDLV DALKRSGFRE GGGTWLSMVL SVHHTHQFNV EYNHDTEPDL PPGTSPLVYA
QELERFPRAH DRIPDWLGAR LEQARELDPE RMREEVGAAL VRACEREGLR ADFLPPTRLR
VFDFGGAVLM EADMRETFDQ AVIAAEEQRT DLAARFADFM ADAAREREQA ADGSSAEASS
TGTRDADPAG APSPPDPDDT VAVSLAAAFA EAGVGAAFQG ADTLVVTLPD GNHASADISG
LRAALGEATP EQIAHNTAQF ARTSVEQLSQ ATGQGGGDTD GRLRVRLYPA SAFPEGVLDS
LLTREIAPGL WQTVVVDASD SLRPLPRQVH ERSGRPDGEV FAEAVAASVA EAVEVSEHEV
DGARIVHIGG QHPYVAAHAH DLDRHLGDLP HGALVAFPVP EVLLAHPLGK GHPIAALDHM
QQVAERFTAD GDKPVSAQLY WWHPGSRSRD RGTPPDLRPV GARIDHENRS VELLTSDEEF
GPMLASLVG