Gene Sare_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1921 
SymbolengA 
ID5708273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2216723 
End bp2218126 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content69% 
IMG OID641271425 
ProductGTP-binding protein EngA 
Protein accessionYP_001536797 
Protein GI159037544 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03594] ribosome-associated GTPase EngA 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.39722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00139595 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGAAC CGAACGGTTG GGTGGAGTTG GACGTCCCGG AACCCGACGC CGAGGAATTC 
ACAGGCCCGC AGCCGGTGGT GGCCGTGGTC GGCCGCCCCA ACGTGGGAAA GTCGACGTTG
GTGAACCGTC TCATCGGCCG CCGGCAGGCG GTCGTCGAGG ACGTCCCCGG GGTGACCCGG
GACCGCGTCC CGTACGACGC GCAGTGGAAC GGCCGACAGT TCGCCGTCGT CGACACCGGC
GGCTGGGAAC CAGACGCGAA AGACCGCGCC GCAGCGATCG CCGCGCAGGC CGAGACGGCA
GTCACCACCG CCGACGTGGT GCTGTTCGTG GTTGACGCAG TGGTGGGCGC TACCGACGTT
GACGAGTCGG CGGTGAAGAT GCTGCGCCGC AGTGCCAAAC CGGTGATCCT GGTGGCGAAC
AAGGCCGACA ACAGCTCCAT CGAAATGGAG GCGGCCACGC TGTGGTCACT CGGCCTGGGC
GAGCCGTACC CGGTATCCGC GCTGCACGGC CGCGGCTCCG GCGAACTGCT CGATGTCATC
ATGGACCGGC TACCGGAGGC ACCGAAGATC ATCGAGGACC GTCCGCGCGG CCCCCGCCGG
GTCGCCCTCG TCGGTAGGCC CAACGTCGGC AAGTCCAGCC TCCTCAACCG CTTCTCCGGC
GAGGTACGGG CAGTCGTTGA CGCGGTCGCC GGCACCACGG TCGACCCGGT CGACAGCCTC
GTCGAGATCG GTGGTGAGGC ATGGCAACTC GTGGACACGG CCGGCCTGCG AAAGCGGGTC
GGCAAGGCCA GCGGCACCGA GTACTACGCG AGCCTGCGCA CCGCCTCGGC GATCGAGGCG
GCCGAGGTCG CGGTGGTCCT GCTCGACGCC AGCGAAGTCA TCAGCGAACA GGACCAGCGG
ATTCTCTCGA TGGTCACCGA CGCCGGCCGG GCCCTGGTGA TCGCCTTCAA CAAGTGGGAC
CTGGTTGACG CCGATCGTCG GTACTACCTT GATCGGGAGA TCGAGCGGGA ACTGCGCCGT
ATCCCGTGGG CGATCCGGCT CAACCTGTCC GCCAAGACCG GCCGCGCGGT CGACAAGCTC
GCCCCGGCGT TGCGTAAGGC CCTGGCCAGT TGGGAAACCC GGGTGCCGAC GGCACAACTC
AACGCGTGGC TCACCGCGTT GGTGCAGGCG ACCCCACACC CCGTACGTGG GGGACGGGCC
CCGAAGATTC TCTTCGCCAC CCAGGCAGGT GCGGCGCCGC CGCGGTTCGT GCTGTTCACG
TCGGGGCCGT TGGACGCGGG CTACCAACGT TTCGTGGAGC GGAAACTCCG TGAGGAGTTC
GGCTTCGAGG GCAGTCCGAT CGAGATCGCG GTCCGCCCCC GTAAGAAGGT CGGCCCTGGC
GGTCGCGGCA AGGCCCACGG CTGA
 
Protein sequence
MSEPNGWVEL DVPEPDAEEF TGPQPVVAVV GRPNVGKSTL VNRLIGRRQA VVEDVPGVTR 
DRVPYDAQWN GRQFAVVDTG GWEPDAKDRA AAIAAQAETA VTTADVVLFV VDAVVGATDV
DESAVKMLRR SAKPVILVAN KADNSSIEME AATLWSLGLG EPYPVSALHG RGSGELLDVI
MDRLPEAPKI IEDRPRGPRR VALVGRPNVG KSSLLNRFSG EVRAVVDAVA GTTVDPVDSL
VEIGGEAWQL VDTAGLRKRV GKASGTEYYA SLRTASAIEA AEVAVVLLDA SEVISEQDQR
ILSMVTDAGR ALVIAFNKWD LVDADRRYYL DREIERELRR IPWAIRLNLS AKTGRAVDKL
APALRKALAS WETRVPTAQL NAWLTALVQA TPHPVRGGRA PKILFATQAG AAPPRFVLFT
SGPLDAGYQR FVERKLREEF GFEGSPIEIA VRPRKKVGPG GRGKAHG