Gene Ndas_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0133 
Symbol 
ID9243964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp162716 
End bp163801 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content69% 
IMG OID 
ProductGTP-binding protein YchF 
Protein accessionYP_003678089 
Protein GI297559115 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCTGT CTATCGGGAT CGTCGGCCTG CCCAACGTCG GCAAGTCCAC CCTCTTCAAC 
GCGCTGACCA AGAACGACGC CCTGGCCGCG AACTACCCGT TCGCGACCAT CGAGCCCAAC
GTCGGGGTCG TGGGCGTTCC CGACGCCCGG CTCGGCAAGC TGGCCGAGAT CTTCGGCTCG
GCGAAGGTCA TTCCGGCGAC GGTGGACTTC GTCGACATCG CGGGCATCGT GCGCGGCGCC
TCCACGGGTG AGGGCCTGGG CAACAAGTTC CTGGCCAACA TCCGCGAGAG CGACGCCATC
TGCCAGGTGA TCCGGGCCTT CGACGACCCC GACGTCACGC ACGTCGACGG CGACGTCGAG
CCCTCCCGCG ACATCGAGAC CATCAACACC GAGCTGATCC TGGCCGACCT CCAGACCCTG
GAGAAGGCGC TTCCCCGCCT GGAGAAGGAC GCCAAGCGCA ACGCCAAGGA CAAGGACGCC
CAGGAGCTGC TCCAGGCCGC CCGCGACGCG CAGCAGGTCC TCGACGGCGG CACCTCGCTG
TCCGCGGCCG AGGGCGTGGA CCTGGACCGG CTGCGCGAGC TGAGCCTGCT CACGGTCAAG
CCGTTCATCT ACGTGTTCAA CCTCGACACC GACGAGCTGG CCGACGGGGC GCTGCGCACC
AAGCTCCAGG ACCTCGTCGC CCCGGCCGAG GCGATCTTCC TGGACGCCAA GATCGAGGCG
GAACTGGCCG AGCTGGACGA GGACGAGGCG CAGGAGCTGC TGGAGTCCAT GGGCCAGACC
GAGTCGGGCC TGGCCCAGCT CGCCCGGGTC GGCTTCGCCA CCCTGGGCCT GCAGACCTAC
CTGACCGCCG GGCCCAAGGA GGCCCGCGCC TGGACGATCC GCAAGGGCGC CACCGCCCCC
GAGGCCGCCG GAGTCATCCA CACCGACTTC CAGCGCGGCT TCATCAAGGC CGAGGTGGTC
TCCTTCGACG ACCTGGTCGC CGCGGGCGAC ATGCAGACCG CCCGCGCGGC GGGCAAGGTC
CGCATGGAGG GCAAGGAGTA CGTGATGGCC GACGGAGACG TCGTGGAGTT CCGCTTCAAC
GTCTGA
 
Protein sequence
MSLSIGIVGL PNVGKSTLFN ALTKNDALAA NYPFATIEPN VGVVGVPDAR LGKLAEIFGS 
AKVIPATVDF VDIAGIVRGA STGEGLGNKF LANIRESDAI CQVIRAFDDP DVTHVDGDVE
PSRDIETINT ELILADLQTL EKALPRLEKD AKRNAKDKDA QELLQAARDA QQVLDGGTSL
SAAEGVDLDR LRELSLLTVK PFIYVFNLDT DELADGALRT KLQDLVAPAE AIFLDAKIEA
ELAELDEDEA QELLESMGQT ESGLAQLARV GFATLGLQTY LTAGPKEARA WTIRKGATAP
EAAGVIHTDF QRGFIKAEVV SFDDLVAAGD MQTARAAGKV RMEGKEYVMA DGDVVEFRFN
V