Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1921 |
Symbol | engA |
ID | 5708273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2216723 |
End bp | 2218126 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641271425 |
Product | GTP-binding protein EngA |
Protein accession | YP_001536797 |
Protein GI | 159037544 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.39722 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00139595 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTGAAC CGAACGGTTG GGTGGAGTTG GACGTCCCGG AACCCGACGC CGAGGAATTC ACAGGCCCGC AGCCGGTGGT GGCCGTGGTC GGCCGCCCCA ACGTGGGAAA GTCGACGTTG GTGAACCGTC TCATCGGCCG CCGGCAGGCG GTCGTCGAGG ACGTCCCCGG GGTGACCCGG GACCGCGTCC CGTACGACGC GCAGTGGAAC GGCCGACAGT TCGCCGTCGT CGACACCGGC GGCTGGGAAC CAGACGCGAA AGACCGCGCC GCAGCGATCG CCGCGCAGGC CGAGACGGCA GTCACCACCG CCGACGTGGT GCTGTTCGTG GTTGACGCAG TGGTGGGCGC TACCGACGTT GACGAGTCGG CGGTGAAGAT GCTGCGCCGC AGTGCCAAAC CGGTGATCCT GGTGGCGAAC AAGGCCGACA ACAGCTCCAT CGAAATGGAG GCGGCCACGC TGTGGTCACT CGGCCTGGGC GAGCCGTACC CGGTATCCGC GCTGCACGGC CGCGGCTCCG GCGAACTGCT CGATGTCATC ATGGACCGGC TACCGGAGGC ACCGAAGATC ATCGAGGACC GTCCGCGCGG CCCCCGCCGG GTCGCCCTCG TCGGTAGGCC CAACGTCGGC AAGTCCAGCC TCCTCAACCG CTTCTCCGGC GAGGTACGGG CAGTCGTTGA CGCGGTCGCC GGCACCACGG TCGACCCGGT CGACAGCCTC GTCGAGATCG GTGGTGAGGC ATGGCAACTC GTGGACACGG CCGGCCTGCG AAAGCGGGTC GGCAAGGCCA GCGGCACCGA GTACTACGCG AGCCTGCGCA CCGCCTCGGC GATCGAGGCG GCCGAGGTCG CGGTGGTCCT GCTCGACGCC AGCGAAGTCA TCAGCGAACA GGACCAGCGG ATTCTCTCGA TGGTCACCGA CGCCGGCCGG GCCCTGGTGA TCGCCTTCAA CAAGTGGGAC CTGGTTGACG CCGATCGTCG GTACTACCTT GATCGGGAGA TCGAGCGGGA ACTGCGCCGT ATCCCGTGGG CGATCCGGCT CAACCTGTCC GCCAAGACCG GCCGCGCGGT CGACAAGCTC GCCCCGGCGT TGCGTAAGGC CCTGGCCAGT TGGGAAACCC GGGTGCCGAC GGCACAACTC AACGCGTGGC TCACCGCGTT GGTGCAGGCG ACCCCACACC CCGTACGTGG GGGACGGGCC CCGAAGATTC TCTTCGCCAC CCAGGCAGGT GCGGCGCCGC CGCGGTTCGT GCTGTTCACG TCGGGGCCGT TGGACGCGGG CTACCAACGT TTCGTGGAGC GGAAACTCCG TGAGGAGTTC GGCTTCGAGG GCAGTCCGAT CGAGATCGCG GTCCGCCCCC GTAAGAAGGT CGGCCCTGGC GGTCGCGGCA AGGCCCACGG CTGA
|
Protein sequence | MSEPNGWVEL DVPEPDAEEF TGPQPVVAVV GRPNVGKSTL VNRLIGRRQA VVEDVPGVTR DRVPYDAQWN GRQFAVVDTG GWEPDAKDRA AAIAAQAETA VTTADVVLFV VDAVVGATDV DESAVKMLRR SAKPVILVAN KADNSSIEME AATLWSLGLG EPYPVSALHG RGSGELLDVI MDRLPEAPKI IEDRPRGPRR VALVGRPNVG KSSLLNRFSG EVRAVVDAVA GTTVDPVDSL VEIGGEAWQL VDTAGLRKRV GKASGTEYYA SLRTASAIEA AEVAVVLLDA SEVISEQDQR ILSMVTDAGR ALVIAFNKWD LVDADRRYYL DREIERELRR IPWAIRLNLS AKTGRAVDKL APALRKALAS WETRVPTAQL NAWLTALVQA TPHPVRGGRA PKILFATQAG AAPPRFVLFT SGPLDAGYQR FVERKLREEF GFEGSPIEIA VRPRKKVGPG GRGKAHG
|
| |