Gene Ndas_2587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2587 
Symbol 
ID9246438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3080754 
End bp3081779 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content70% 
IMG OID 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_003680511 
Protein GI297561537 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.969826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0068543 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGTGC CGATGGACGC GCCCGAGTCC TCCCAGCACG CCGACCCCGA GGCGGCGGCC 
GCCGGAGGAC GCGGGTCGGC GAACAGGTCC CTGCGCCAGA TCGCCTGGCG GCGCTTCCGC
AGGGACCGCC TCGGCATGGC CGGGGGCGTC GTCGTCATCC TGCTCATCCT GGTCGCGGTC
TTCGCGCCCC TGCTCACCTC CTGGTTCGGC TACCCGCCCA ACCAGTTCAA CCAGGAGCTG
ATCGACCCGC TCACCGGCGG CGTCCTGCGC GACCCCGCCG ACCCCTCCCT GGGGCTCGAC
CCCTGGGGCG GTATCAGCGC CGACCACCTG CTCGGCGTGG AACCCGTCAA CGGGCGTGAC
CTGTTCAGCC GCATCGTCCA CGGCGCCCGC ACCTCCCTGC TGGTCGCCAC GGTCGCCACC
CTGGTCTGCG TGGTCATCGG CACCGTCCTG GGCATGGTCG CCGGGTACTT CGGCGGCTGG
GTCGACACCG TCATCAGCCG GGCCATGGAC ATCTTCCTGG CCTTCCCGCT GCTGCTCTTC
GCCATCGCCC TGGTCGGCGT CATCCCCGAC GGCTCCTTCG GCCTGAGCGG CAACGGCCTG
CGCATCGGCG TGCTGGTCTT CATCATCGGG TTCTTCAACT GGCCCTACAT CGGCCGCATC
GTGCGCGGAC AGACCCTGAC CCTCCGGGAG CGCGAGTTCG TGGAGGCCTC CCGCAGCCTC
GGCGCGGGCA GCGCCCACAT CGTCTTCCGC GAGATCCTGC CCAACCTCGT CACGCCGATC
CTGGTCTACT CCACGCTGCT CATCCCCACC AACATCCTGT TCGAGGCGGC CCTGAGCTTC
CTGGGCGTGG GCATCAACCC GCCCATGGCC ACCTGGGGCG GCATGCTCGA CAACGCCCTG
CGCTTCTACA CCGTCGCACC GCACTTCGTG CTCATCCCGG GGCTGGCCAT CTTCGTCACC
GTCCTGGCCT TCAACCTCTT CGGCGACGGG CTGCGCGACG CCTTCGACCC CCGCTCCTCC
GACTGA
 
Protein sequence
MSVPMDAPES SQHADPEAAA AGGRGSANRS LRQIAWRRFR RDRLGMAGGV VVILLILVAV 
FAPLLTSWFG YPPNQFNQEL IDPLTGGVLR DPADPSLGLD PWGGISADHL LGVEPVNGRD
LFSRIVHGAR TSLLVATVAT LVCVVIGTVL GMVAGYFGGW VDTVISRAMD IFLAFPLLLF
AIALVGVIPD GSFGLSGNGL RIGVLVFIIG FFNWPYIGRI VRGQTLTLRE REFVEASRSL
GAGSAHIVFR EILPNLVTPI LVYSTLLIPT NILFEAALSF LGVGINPPMA TWGGMLDNAL
RFYTVAPHFV LIPGLAIFVT VLAFNLFGDG LRDAFDPRSS D