Gene Ndas_3481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3481 
Symbol 
ID9247350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4172317 
End bp4174095 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content72% 
IMG OID 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_003681388 
Protein GI297562414 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGG CACCCGCCCC CCGCCGCACC GCGGTCGGCA CCACGGTGGC CCGTGCGCGC 
ACGGCACTGA GGCAGCCCGC CGTGGTGCTG GGAGCCGGCA CCCTGCTCGT ACTGTCCGTC
CTCGTGGTGG CGCCGCTCAG CGGGCTGGTC AACACGACCC TCCAGAACGG CAACCGCGAG
GCGTGGGCGG ACGTGTTCGC CAGCCCGATG TCGGAGAACC TCCTGTGGAG GCCGATGGGC
AACTCGATCC TCATGGGGAC CGCCACCGCC GTCTGCTCGA CGGTGGTGGG CGGGTTCCTG
GCCTGGGTGG TCGTGATGAC CCGCATCCCG GGGCGCAGGA CCCTGGGGCT GCTGGCGACG
ATCCCGTTCG CCCTGCCGAG CTTCGCCCTG GCGCTGGCGT GGGAGTCGGT CTTTCGCAAC
GACCTGATCG GCGGGTCCAC CGGGATCCTG ATGAACCTGG GCCTGGACGT CCCGGACTGG
CTGGCCTGGG GACCCGTACC GATCGCCGCG ACCCTGACGG CCCACTACTT CTCGCTGTCG
TTCATGCTGG TCGCCGCGGC GCTGGCCAGC GTCAACGGCG ACCTCATGGA GGCCGCCGAG
ATGACCGGGG CCTCCACGCT GCGGGTGGCC CGGGACATCG CCCTGCCGGT CGTGGCCCCG
GCGATGATCT CCGGCGCGCT GCTCGCCTTC GCCGAGGGGG TCTCCAACTT CGTCTCGCCG
GCCCTGCTCG GCCTGCCGGT GCGGTTCCAC ACGCTCTCCA CCCGGCTGTA CGGGGCGATC
TCCACCGGTG ACGTCGCGCG CGGGTACGTG CTCTCGATCG TGCTCATCGT CGTCGCCGCG
ATGATCATGT ACGCCTCGAC GCGCATCACC GGGGGCGGGC GCAGCTTCGC GACGATCACC
GGCAAGGGCG GCCGCCGCCG GGGCGTGGAC CTGGGCCCCT GGGGGTGGCC GGTGGCCGGG
CTCGCCTGGC TGCTGGTCAC GTGCACCACG GTCGTGCCCG GCCTCGTGCT CGTCCTGAGC
TCGCTGACCG TGCGCACGAA CGACTTCGCC TCCGGGCTGA CCCTGCACTA CTGGATCGGC
GGCTCGGACC CGGCGCTCGC CCAGGGACAG CGGGGCGTCC TGAACAATCC GCAGATCCTG
GAGGCCACCT GGAACACCGT CCTGCTCGGG GTGTGCGTGG CGGTCGGCGC CGGTGTGCTC
GGGCTGCTCA TCTCCTACGT CATCACCCGC TCCAACGGCC CGAGGTGGCT GGTGGGGACC
ATGAGCGTCA CCTCGTTCGT GCCCTTCCTC ATCCCCGGCA TCGCGCTGGG CGCGGCCTTC
ATCGCGCAGT TCGGGGCGCC CATCGGCCCG CTCCCGAGCC TGTACGGGAC GTTCGCGATC
CTCGTGCTGG CCGGGATCGC GGCCACGATC CCGTTCTCCG TCCGGTCGGG CACCTCGGCC
CTGAGCCAGG TGTCCAGGGA CGTCGAGGAG TCCGCGGTGA TGGCGGGGGC CGGGCTGACG
CGCAGGATCG GCGCGGTCAT CGCGCCGCTG ACCGCGCGCG GCCTGTTCAC CGGCGGGGTG
CTGGTGTTCG TCCAGATGGT GCGCGACCTG TCCCTGGTCG TGCTGCTGGC GACCCCGGCG
ATGCCGGTGC TGGCCGTGCT CACCTACCAG TACTCCTCGG AGAACTTCAC CCAGTTGGCC
AACGCGGTCA CCGTGGTCAT CGCGGTGATC TCGGTCGCGG CAACCGTCAT CGCGCGGCGC
TTCGAGGGCG CCGCCCAGCC CTGGAACTCC CGATCATGA
 
Protein sequence
MTTAPAPRRT AVGTTVARAR TALRQPAVVL GAGTLLVLSV LVVAPLSGLV NTTLQNGNRE 
AWADVFASPM SENLLWRPMG NSILMGTATA VCSTVVGGFL AWVVVMTRIP GRRTLGLLAT
IPFALPSFAL ALAWESVFRN DLIGGSTGIL MNLGLDVPDW LAWGPVPIAA TLTAHYFSLS
FMLVAAALAS VNGDLMEAAE MTGASTLRVA RDIALPVVAP AMISGALLAF AEGVSNFVSP
ALLGLPVRFH TLSTRLYGAI STGDVARGYV LSIVLIVVAA MIMYASTRIT GGGRSFATIT
GKGGRRRGVD LGPWGWPVAG LAWLLVTCTT VVPGLVLVLS SLTVRTNDFA SGLTLHYWIG
GSDPALAQGQ RGVLNNPQIL EATWNTVLLG VCVAVGAGVL GLLISYVITR SNGPRWLVGT
MSVTSFVPFL IPGIALGAAF IAQFGAPIGP LPSLYGTFAI LVLAGIAATI PFSVRSGTSA
LSQVSRDVEE SAVMAGAGLT RRIGAVIAPL TARGLFTGGV LVFVQMVRDL SLVVLLATPA
MPVLAVLTYQ YSSENFTQLA NAVTVVIAVI SVAATVIARR FEGAAQPWNS RS