Gene Ndas_4000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4000 
Symbol 
ID9247872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4783872 
End bp4784828 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content70% 
IMG OID 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_003681903 
Protein GI297562929 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.224944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.135837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTG TGACCGCGCC CGCGCGCGAG GCGCCCGCGG CGGCGACGCC GCCCGAGCGG 
CCGAGGCGGA GCCGCGGCGG GGGCGGATGG GCCCGCCGGG CCCCGCTCCT GCCCGCCCTG
GTGTTCATGC TGGTCGTCAC GCAGCTGCCG TTCCTGGCGA CGGTCGTGTA CTCGCTGCGC
TCGTGGAACC TGCTGCGGCC CGACTCCCAG GCGTGGGTGG GCCTGGCCAA CTACGCGGCG
GTCTTCACCG ACCGGCAGTT CCTGGGCGCG GCGGCGAACA CGGTGCTGAT CACCGCCTCC
TGCGTGGTGG TGGCGATGCT GCTGGGCATC GGGCTCGCGC TGCTGCTGGA CCGGAAGTTC
AGAGGGCGAG GCGTGGTGCG CACGCTCGTC ATCACCCCGT TCCTGATCCT GCCCGTGGCC
ACGGCGCTGC TGTGGAAGCA CATCATGCTG GAGCCGGTGT TCGGCCTGGT GAACTTCGTG
CTGTCGCCGT TCGGGGTCGA GTCGTTCGAC TGGGTCTCCC AGGCGCCCGT GTTCTCCGTG
GTGGCCGCGC TGGTCTGGCA GTGGACGCCG TTCATGATGC TGCTGGTGCT GGCGGGACTC
CAGAGCCAGG GCTCCGACGT ACTGGAGGCG GGCCAGGTCG ACGGCGCGTC CCGGTGGCAG
ACGTTCGCCT GGATCACCCT GCCGCACCTG CGCCGCTACA TCGAGCTGGG CGTGCTGCTG
GGGTCGGTCT ACGTGGTCAA CACCTTCGAC ACGATCTACA TGATGACCCA GGGCGGGCCG
GGCACCGCCA GCTCCAACCT GCCCTTCTAC GTCTACCAGC GGACCTTCCT CGGGTTCGAC
ATCGGCCAGT CCGCGGCCAT GGGCGTCGTG GTGGTGGTCG GCACCATCAT CGTGGCGACG
CTGGCCCTGC GGCTGATCTT CCGCACGTTC ATGAACGCAC AGGAGGCGAC ATCGTGA
 
Protein sequence
MSAVTAPARE APAAATPPER PRRSRGGGGW ARRAPLLPAL VFMLVVTQLP FLATVVYSLR 
SWNLLRPDSQ AWVGLANYAA VFTDRQFLGA AANTVLITAS CVVVAMLLGI GLALLLDRKF
RGRGVVRTLV ITPFLILPVA TALLWKHIML EPVFGLVNFV LSPFGVESFD WVSQAPVFSV
VAALVWQWTP FMMLLVLAGL QSQGSDVLEA GQVDGASRWQ TFAWITLPHL RRYIELGVLL
GSVYVVNTFD TIYMMTQGGP GTASSNLPFY VYQRTFLGFD IGQSAAMGVV VVVGTIIVAT
LALRLIFRTF MNAQEATS