Gene Ndas_1247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1247 
Symbol 
ID9245097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1549810 
End bp1550865 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003679192 
Protein GI297560218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.232923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAAC GGCTGACGAA CCTGGAACCA CGCCTGCACG ACGCGCTGGC GGCGTGGGGC 
GTCCACGCGA CCTCGATCGA CCACGTCCCC CTGGGTTTCG GCGACCACCA CTGGAGCGTC
ACCGACACCG CGGGCCGCCG CTGGTTCACC ACCGTGGCCG ACCTCGCCCG CAAGTCCTTC
CTCGGCCCGG ACCCGGCCGC CGTGCGGCGG CGCCTCACCC GGGCCATGGA CACCGCCGCC
CGGCTGCACG ACGACGAGGG GCTCGGCTTC GTCGTCGCGC CCCTGCGCAC CCCGGGCGGG
GACACCGTCG TCCCGGTCGG CGACGGGTAC GCGCTCAGCG TCTTCCCCCG CGTGGGAGGG
CGAGTCCGGA GACTTCGGCC AGGAGCTGTC CGCCGACCGG CGGGCCCGGC TCCTGGACAC
CCTCGCCCAG CTGCACCGCA GCGCACCGGG CGACGCGCCC GCCGTGGAGA CCCGCCTCCC
CGGCCAGGAC CGGCTCGCCG CGCTGCTGGA CCGCCCCGCC CGCCGTCGGC GACGCGGGGG
CCCCCACGCC GGGCCCACCG CCGACCTGCT CGCCGAGCAC GCCCCCGGCG CTGCGCGAGC
GCCTGGCCGA GTCCGACCGC GGCGCGGCCG CCCTGGAGGA CGCGGCGGCG GTCCTCACCC
ACGGCGAACC CCACCCCGGC AACCTGCTGT GGCGCGGCGA CCGCCCGCTG CTGGTCGACT
GGGACACCGT CGGCCTGGCC GCCCCCGAGC GCGACCTATG GCTGGTCACC GACGACCCCG
CCGAACTGGA ACGCTACGCC GAGGTCAGCG GGCACGAGCC CGACCGCGCA CTGCTGGACC
TGTACCGGCT GCGCTGGGAC CTGCGCGACG TCGTCGAGTT CGTCGACTGG TTCCGCGCGC
CCCACGAGGG AGGCCCCGAC ACCTCCCAGG CCTGGCGGGA CCTGGTCCGC ATCGTCGAAC
GCCTCGGCGC CGGGGAGCGG TCCGGCGCCC GCTGACGCTG CCCGGATCCG GCGGCCCGGC
GGGTCGCGGC CCGCTCGCGC TGTTCGCGGT CGGTAA
 
Protein sequence
MRERLTNLEP RLHDALAAWG VHATSIDHVP LGFGDHHWSV TDTAGRRWFT TVADLARKSF 
LGPDPAAVRR RLTRAMDTAA RLHDDEGLGF VVAPLRTPGG DTVVPVGDGY ALSVFPRVGG
RVRRLRPGAV RRPAGPAPGH PRPAAPQRTG RRARRGDPPP RPGPARRAAG PPRPPSATRG
PPRRAHRRPA RRARPRRCAS AWPSPTAARP PWRTRRRSSP TANPTPATCC GAATARCWST
GTPSAWPPPS ATYGWSPTTP PNWNATPRSA GTSPTAHCWT CTGCAGTCAT SSSSSTGSAR
PTREAPTPPR PGGTWSASSN ASAPGSGPAP ADAARIRRPG GSRPARAVRG R