Gene Ndas_1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1783 
Symbol 
ID9245633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2182000 
End bp2183076 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003679717 
Protein GI297560743 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.203269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.758933 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGGGG CGGCCGTGGA CTGGTACCCG CTGAGGGTGA GCGCCCCGGC CCGACAGCTG 
GTGTTCGGCG GCCACGCCAT CGCCCGCCAC CTGGGCCGGG AGGGGCTGCC CGACTGGGCG
GTCGCGGAGA CCTGGGAGGT CAGCGACGTC GACGGGAACG GCAGCACGGT ACTCGACGGG
CCCCTCGCGG GCCGGTCCCT GCGCGAGCTC GTCGCGCGGT GGCCGGAGGG GTTGGTGGGC
GAGGACTGGT CCGGGGAGGT CTTCCCGGTG CTGACCAAGT TCATCGACGC CTCCGGAACG
CTGCCGGTGC ACCTGCACGC CGACGACGCC ACCGCCCGGC GGCTGGAGGG ACAGCCCAAC
GGCAAGACCG AGGCCTGGCA CATCCTCGAC GCGCCTCCCG GCGCCACCGC GCTGTGCGGG
GTCAGGAGCG GGGTGACCGG GGAGCGGCTC CACCAGGCAC TGCTCGACCA GGACTTCGAC
GCCGTGCTGC GCCGCCTGCC GGTGCGGCCC GGCGAGACGG TCTACGTCCC GGGCGGCACC
GTGCACAGTT TCGGCCCCCG GACCCTGGTC TACGAGATCG AGCAGACCTC CGACGTCCAG
CAGCACGCGA TGCGCTGGGA GATGGAGGAC GGCTCACCGG TCCCGGACGA GCGGTGGCGC
GCGAACCTGG AGGCGCTGAT GGCCCAGGTC CGGCCGGAGC ACAGGCCCGA CTTCCACCCG
GGGCTGAGGA TCGGGGTCGG CGACGGCGTG GAGCGGGTGT TCTGCTGCGC CGGACCGCAC
TTCGCGCTCG AACGCTGGCA CGCGGGCACC GCCGAGCCCC TGCGCCACAC GTTCGCCACC
GCGCAGGTCC TCACCAACGT CGGGGCGCCC GTCCGGGTGC GCTGCGGCGA CTGGCGTGGT
GAGCTGGGCC GGGCCCGGAC GCTGCTGCTG CCCGCCGCGT TGGGCGAGGT GGAGATCGCG
GGCCCGGCCG ACGTGCTGTT CGGCTACCTG CCCGACCTGG ACCGCGACGT GGTCGCCCCC
CTGGCCGCCG CCGGTTACCC CCGTGAGGCC GTCGCCTTCC TCGGCGAGGG CCTGTGA
 
Protein sequence
MTGAAVDWYP LRVSAPARQL VFGGHAIARH LGREGLPDWA VAETWEVSDV DGNGSTVLDG 
PLAGRSLREL VARWPEGLVG EDWSGEVFPV LTKFIDASGT LPVHLHADDA TARRLEGQPN
GKTEAWHILD APPGATALCG VRSGVTGERL HQALLDQDFD AVLRRLPVRP GETVYVPGGT
VHSFGPRTLV YEIEQTSDVQ QHAMRWEMED GSPVPDERWR ANLEALMAQV RPEHRPDFHP
GLRIGVGDGV ERVFCCAGPH FALERWHAGT AEPLRHTFAT AQVLTNVGAP VRVRCGDWRG
ELGRARTLLL PAALGEVEIA GPADVLFGYL PDLDRDVVAP LAAAGYPREA VAFLGEGL