Gene Ndas_1945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1945 
Symbol 
ID9245795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2368778 
End bp2370049 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content75% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionYP_003679878 
Protein GI297560904 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0388376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.012514 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCACCT CCTCCTCCGG CCCCCAGGCC AGACCCGCCC GCGTCCTGGG CACCGTCGAC 
GCCGTCGCCA TCGGCGCGGG CGCGATGCTC GGCGCCGGGG TGTTCACCGT CTTCGCCCCG
GCCGCGCTCG GCGCGGGCGC CTGGCTGCCC GTGGCCATCC TCATCGCGGG GATCGTCGCC
TACTGCAACG CCGTCTCCTC GGCCAGGCTC GCCGCCCGCC ACCCCGAGTC CGGCGGCGCC
TACGTCTACG GCCGCGAGCG CCTGGGCGAC CTGTGGGGCT ACCTGGCGGG CTGGTCCTTC
GTGGTCGGCA AGACCGCCTC CTGCGCGGCC ATGGCGCTGA CCTTCGGCGC CTACCTCCTC
CCCGGCTGGG AGAAGGCCCT GGCGGCGGCC ACCGTGGTCG CGCTCACCGC GCTCAACTAC
CGCGGGATCC AGCGCTCCAC GATGGTGATC AAGATCCTGC TGGCCGCGGT CCTGGCCGTC
CTGGCCGCCG TGTGCGTTGC CATGCTCCTC AGCGGGCGCC TGGCCGCCGT CGCCCAGGTG
GAGGGCCTGG GGCCCTGGCC CGGCTCCTTC GGTGTGCTCG GCGCGGCCGG GATGATCTTC
TTCGCCTTCG CCGGGTACGC CAGGGTGGCC ACCCTGGGCG GCGAGGTCCG CGACCCCCAG
ACCACGATCC CCCGCGCGGT CAACATCTCC CTGGGCGGCG TGCTGGCGCT CTACCTCGTC
ATCGGCACCG GACTGCTGAT CGTGCTGGGC CGCGAGCGCC TGTCCACCAC CGACACCCCG
ATGGTCGACG CCGTCCGGGT GGCCGGGTTC GACTGGCTGG CCCCCGTGGT GGTCGCGGGC
GCGGCCGTGG CCAGCCTGGG GGCCCTGCTC TCCCTCATCC TCGGGGTCTC GCGCACCACC
GCCGCGATGG CCGCCGACGG ACACCTGCCC CGCTCCCTGG CCTCGGTCCA CGAGCGCCAC
GGGGTGCCCC ACCGCGCCGA GCTGCTGGTC GGCGCCGTCG TCATCGCCCT GGTCCTGGTC
ACCGACCTGC GCGGCGCGAT CGCCTTCTCC GCCTTCGGAG TGCTGGTCTA CTACGCCATC
ACCAACGCCT CGGCGCTGCG CCTGGGTCCC GACGAGCCCC GCCCGCCGCT GCTGGTGCCG
CTGACCGGGC TGGGCGGCTG CGTCGTGCTG GCGTGCAGCC TGCCGCTGCC CACGGTCATC
ACCGGCATGG CGGTCCTGGC GGTGGGAGCC GGGCTGTGGT GGCTGCGCGA GCGCTTCGAC
CTGCGCTTCT AG
 
Protein sequence
MTTSSSGPQA RPARVLGTVD AVAIGAGAML GAGVFTVFAP AALGAGAWLP VAILIAGIVA 
YCNAVSSARL AARHPESGGA YVYGRERLGD LWGYLAGWSF VVGKTASCAA MALTFGAYLL
PGWEKALAAA TVVALTALNY RGIQRSTMVI KILLAAVLAV LAAVCVAMLL SGRLAAVAQV
EGLGPWPGSF GVLGAAGMIF FAFAGYARVA TLGGEVRDPQ TTIPRAVNIS LGGVLALYLV
IGTGLLIVLG RERLSTTDTP MVDAVRVAGF DWLAPVVVAG AAVASLGALL SLILGVSRTT
AAMAADGHLP RSLASVHERH GVPHRAELLV GAVVIALVLV TDLRGAIAFS AFGVLVYYAI
TNASALRLGP DEPRPPLLVP LTGLGGCVVL ACSLPLPTVI TGMAVLAVGA GLWWLRERFD
LRF