Gene Sare_4160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4160 
Symbol 
ID5707709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4725072 
End bp4726157 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content68% 
IMG OID641273587 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_001538940 
Protein GI159039687 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000134834 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGTCTCA CCATCGGCAT CGTCGGCCTG CCCAACGTCG GCAAGAGCAC CCTGTTCAAC 
GCGCTGACCA AGAACGACGT GCTCGCGGCG AACTACCCCT TCGCCACCAT CGAGCCCAAC
GTCGGCGTGG TCGGCCTGCC GGACGAGCGG CTGGGTAAGC TCGCCGAGAT CTTCGACTCG
CAGAAGGTGA TCCCCGCGCC GGTGTCGTTC GTCGACATCG CCGGGCTGGT CCGGGGCGCC
TCGAAGGGCC AGGGGCGGGG CAACGCGTTC CTCGCCAACA TCCGGGACGC CGCCGCGATC
TGTCAGGTCG TTCGTGCCTT CTCCGACCCG AACGTGGTGC ACGTCGACGG CAAGGTCGCT
CCCGCGGACG ACATCGAGAC CATCAACACG GAGCTGATCC TCGCCGACCT GCAGACGCTG
GAGCGGGCGA TTCCCCGGCT GGAGAAGGAG GCCAAGCTCC GCAAGGACCG GGCCGCCGCC
CTGGCCGCCG CCAAGGCCGC CGTGGAGGTC CTCGACAACG GCACCACCCT GTACGCGGGC
GCCGCTGCCG CCGGTGTCGA GCTGGAGCAT CTGCGCGAGC TGCATCTGCT GACCACCAAG
CCCTTCCTGT ACGTGTTCAA CGTTGACGAG GCCGAACTGG CCAACGCCGA GTTCCTCGAC
GAGCTGCGGG CCCTGGTCGC CCCCGCCGAG GCTGTCTTCA TGGACGCAAA GATCGAATCG
GAGCTGGTAG ATCTGCCCGA AGAGGAGGCC CGCGAGCTAC TGGAGTCGAT CGGGCAGTCC
GAGCCGGGAC TGGACCAGCT CGTCCGGGTC GGCTTCCGCA CGCTGGGGCT CCAGACGTAC
CTCACCGCCG GACCCAAGGA GGCGCGGGCC TGGACCGTGC ACGTCGGGGC GACCGCCCCG
GAAGCCGCCG GGGTTATCCA CTCCGACTTC CAGCGCGGCT TCATCAAAGC TGAGGTCGTC
TCCTACAACG ACCTGCTCGA GGCGGGATCG ATGAGCGCCG CGAAGGCGGT GGGCAAGGTC
CGCATCGAGG GCAAGGACTA CGTCATGCAG GACGGCGACG TGGTGGAGTT CCGCTTCAAC
GTCTGA
 
Protein sequence
MSLTIGIVGL PNVGKSTLFN ALTKNDVLAA NYPFATIEPN VGVVGLPDER LGKLAEIFDS 
QKVIPAPVSF VDIAGLVRGA SKGQGRGNAF LANIRDAAAI CQVVRAFSDP NVVHVDGKVA
PADDIETINT ELILADLQTL ERAIPRLEKE AKLRKDRAAA LAAAKAAVEV LDNGTTLYAG
AAAAGVELEH LRELHLLTTK PFLYVFNVDE AELANAEFLD ELRALVAPAE AVFMDAKIES
ELVDLPEEEA RELLESIGQS EPGLDQLVRV GFRTLGLQTY LTAGPKEARA WTVHVGATAP
EAAGVIHSDF QRGFIKAEVV SYNDLLEAGS MSAAKAVGKV RIEGKDYVMQ DGDVVEFRFN
V