Gene Ndas_2439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2439 
Symbol 
ID9246289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2892809 
End bp2893870 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content77% 
IMG OID 
Producttransport system permease protein 
Protein accessionYP_003680365 
Protein GI297561391 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.107589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCGC GCTCCGGGGC CCTGGGTCCC CGAACGGGGG TGGCCCGCCT GGGCGGACTG 
TCGGTACGGG TGTACTGGCC CGCGGTACTG CTCGGCGGCC TGCTCACCGC CCTGGCCGCG
GCGGTGGCCC TGGTCTCGCT GACCCTGGGG GACTTCGAGC TGGGCGTGAG CGAGGTGGTG
GACGCGCTCA CCGGCCGTGC CGGGGTGATG GTGACGCACG TGGTGGTGGA GATGCGGCTG
CCGCGTGTGC TCACCGCGCT GGGTGTGGGC GCCGCACTGG CGCTCTCGGG GGCGCTGCTG
CAACGGCTGG CGCACAACCC GCTGGTCAGC CCGGACGTCA TCGGGGTCAG CGCGGGCGCG
ACGACCGCGG CGGTGCTCGC CATCGTCGTC TTCGGCGGCA CGGCGGCGGC GATCGCGGCC
AGCGCGCTGG CGGGGGCCGT GGCCACCGCG TTCCTGCTGT ACCTGCTCGC CTACCGGCGC
GGTGTCAGCG GGCAGCGGCT GGTCCTGGTG GGGATCGCGG TCACCGCGGT GCTGGGCGCG
GTGACGTCGT ACCTGCTCAC CCGCACGGAG CTCGCCACGG CGCAGCGCGC CATGCTCTGG
CTCACCGGCA GCCTGGCCAA CCGGGACTGG CCGCACGTGG TGACGGTGGC GGTGGGGTTG
GCCGTCCTGG CTCCGACCAC GTTCCTGTCG GCCCGACCGC TGTCCCTGCT CCAGCTCGGG
GAGGACGCGG CGACCGCCCT GGGCGGCCGG GTGCGGCTCG CCCGGGGCGC CCTGCTGTTC
ACCTCCGCCG CGCTCGCGGC CACGGCCACC GCCGTCGCCG GTCCCGTCGC GTTCGTCGCC
CTGGTGGCCC CGCAGATCGT GCGGCGGCTG CTGGGCGGAC GCGCCCTCGG GCTGCTGCCC
TGCGCCGCCT GCGGAGCGCT CCTGACGGCC GTTGCGGACC TGGTCGCGCG CACCGCCTTC
GGGGGGAGCG AACTGCCGGT CGGGGTGGTC ACCGGGGCGC TGGGCGCCCC CTTCCTGCTG
TACCTGCTGG CCCGCGGCGG CAGGGCGGGA CGGGACCGGT GA
 
Protein sequence
MTARSGALGP RTGVARLGGL SVRVYWPAVL LGGLLTALAA AVALVSLTLG DFELGVSEVV 
DALTGRAGVM VTHVVVEMRL PRVLTALGVG AALALSGALL QRLAHNPLVS PDVIGVSAGA
TTAAVLAIVV FGGTAAAIAA SALAGAVATA FLLYLLAYRR GVSGQRLVLV GIAVTAVLGA
VTSYLLTRTE LATAQRAMLW LTGSLANRDW PHVVTVAVGL AVLAPTTFLS ARPLSLLQLG
EDAATALGGR VRLARGALLF TSAALAATAT AVAGPVAFVA LVAPQIVRRL LGGRALGLLP
CAACGALLTA VADLVARTAF GGSELPVGVV TGALGAPFLL YLLARGGRAG RDR