Gene Ndas_2205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2205 
Symbol 
ID9246055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2635324 
End bp2636415 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content75% 
IMG OID 
Producttransport system permease protein 
Protein accessionYP_003680133 
Protein GI297561159 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.264091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000202612 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCCGCA CCCTGACACC CCGCCGCCCG CGCGAGCGCG CCCGCACCAC ACCCCGCGGC 
TCCTTCGCCC TGCGCCTGTT CGGCGAACGC CTGTCCCTGC TGGTGCGCCC GCGCACCCTC
GTCGTCGCCG CCGTCCTGAC CGCGCTCAGC GCCGCCGCCC TGATCGTCTC GGTGGCCGTG
GGCGACTACG AGATCCCGCT CGGCGCCGTG CCCGCCGCGA TCGCGGGCTA CGGCGAACGC
CTGGACGTGT TCTTCGTCCA AGGGGTGCGC CTGCCCCGCG CCCTGACCGC CATCGGCGTG
GGCGCCGCCT TAGGGCTCGC CGGAGCCGTC TTCCAGAGCC TGTCGCGCAA CGCCCTGGGC
AGCCCCGACA TCATCGGCTT CACCGGCGGC GCCGCCACCG GAGCCGTCGC CGTCATCCTG
CTCTTCGGCG CCGGACGCCT GGGCGTGTCC CTGGGCGCCA TCGCCGGGGG CATGCTCACC
GCCGCCGCCG TGTACCTGCT CTCCACCAAG AACGGCGTCC AGGGCTACAG GCTGGTCCTG
GTCGGCATCG GCATGGCCGC CATGCTCGGC GCCGTCCGCG ACTACCTGCT CACCCGAGCC
GAACTCACCG ACGCCCTCGG CGCCCAGATC TGGATGATCG GCAGCCTCAA CGGCCGCGGC
TGGGCCGAGG TCGCGGCCGT GTGGATCTGC CTGGTCCTGC TGGGACCGGT CCTGCTCGCC
CTGGGCCAGC GCCTGCGCTT CATGGAACTG GGGGAGGACA CCGCGCGCGG CCTGGGCGTG
CCCACCCGCT CCACCCAGCT GACCGCCCTG GCCGCCGCCA GCGCCCTGAC CGGCGCCGCC
ATCGCCGTCT CGGGCCCCAT CGGCTTCGTC GCCCTGGCCG CACCCCAGCT GGCCCGCCGC
CTGATGCGCA CCGGCGGCAC CACCCTGGCC GGATCCGCGC TCATGGGCGC CGCCCTGCTG
GCGGTGGCCG ACCTGGTCGC GCTGCGCGCC CTGGCCCCCA CCCAGCTGCC CGTGGGCGTG
GTCACCGCCG TCATCGGCGG CAGCTACCTG ATCTGGTTGC TCTACACCGA ATGGCGCGGC
GGACGTGCCT GA
 
Protein sequence
MSRTLTPRRP RERARTTPRG SFALRLFGER LSLLVRPRTL VVAAVLTALS AAALIVSVAV 
GDYEIPLGAV PAAIAGYGER LDVFFVQGVR LPRALTAIGV GAALGLAGAV FQSLSRNALG
SPDIIGFTGG AATGAVAVIL LFGAGRLGVS LGAIAGGMLT AAAVYLLSTK NGVQGYRLVL
VGIGMAAMLG AVRDYLLTRA ELTDALGAQI WMIGSLNGRG WAEVAAVWIC LVLLGPVLLA
LGQRLRFMEL GEDTARGLGV PTRSTQLTAL AAASALTGAA IAVSGPIGFV ALAAPQLARR
LMRTGGTTLA GSALMGAALL AVADLVALRA LAPTQLPVGV VTAVIGGSYL IWLLYTEWRG
GRA