Gene Elen_2011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2011 
Symbol 
ID8416322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2357104 
End bp2358498 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content68% 
IMG OID645024988 
ProductGTP-binding protein Obg/CgtA 
Protein accessionYP_003182364 
Protein GI257791758 
COG category[R] General function prediction only 
COG ID[COG0536] Predicted GTPase 
TIGRFAM ID[TIGR02729] Obg family GTPase CgtA
[TIGR03595] Obg family GTPase CgtA, C-terminal extension 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000147894 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000000362877 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTTCATCG ATAAAGTACG CATCCATGTC AAAGGCGGCA ACGGCGGTGC CGGCTGCATG 
TCATTTCGCC GCGAGGCTCA TGTGCCGAAG GGCGGTCCGG ACGGCGGCGA TGGCGGTCAC
GGCGGCAACG TTGTGGTCGA GGCCGATGCC AGCCTGTCCT CGCTCATCGA ATACCGCTTC
AAACACCATT TCAAAGCCGA GCGCGGCACG CACGGCAAGG GCTCGCGCAT GCACGGAGCC
ACCGGCGAGG ACCTTGTGCT CAAGGTGCCG ATGGGCACGG TCGTCCACGA GTATTTCGAA
GAATCGAAGG AAGTGGGCGA GCTCATCGCC GACCTCACGC ACGACGGCGA ACGCGTCACC
GTGGCCGAGG GCGGCATGGG CGGCCGCGGC AACATCCACT TCGTGACGCC GACCCGACGC
GCGCCCGCGT TCGCCGAGCT GGGCGAGCCG TCGCAGGAGC GCTGGATCGA GCTGGAGATG
AAGCTCATGG CCGACGCGGC CCTCGTGGGC ATGCCGTCGG CGGGCAAGTC GTCGCTCATC
GCTAAGATGA GCGCGGCGCG GCCGAAGATC GCCGACTACC CGTTCACCAC GCTCGTGCCG
AACCTCGGCG TGGCGCGTTC GGGGGACTAC AGCTTCGTCG TGGCCGACAT CCCCGGCCTC
ATCGAGGGCG CGCACGAAGG GCGCGGTCTG GGACACGAGT TCCTGCGCCA CATCGAGCGC
ACGGCGCTCA TCGTGCACGT GGTGGACCTG ACGGGCGACT ACGAGGGACG CGATCCGCTG
GAGGATTACG ATATCATCAA CCGCGAGCTG GCGCTGTACG CCGACGAGCT GGCGGCGCGC
CCGCGCATCG TGGTGGCGAA CAAGATCGAC GTGCCCGGCG CGGAGGAGGT CGCCGACCGG
CTGGCCGAGC GCGTGCGCGA GGACTCGATC GCGGCAGCGG GCGGCGACGA GTTCGCCCCG
AGCCCCGTCG ATCCGAAGCT CTACCGCATC AGCGCGCTCA CGGGCGAGGG CGTCGACGGC
CTCAAGGCCG CCATCGCGAC CAAGGTGCAC GAGCTGCGCG AGGAGCTGCG CGCGCTTTCG
GAGGCCGACG TGCAGTACGA GCACGTGTGG GAGCACAAGC GCGAGGAACG CGACAAGCAG
TTCAAGGTCG TGCCGCTCGG CGGCGGGGTG TTCCGCGTCG AGGGCCCGCA GGTGGAGCGC
ATGGTGGTGC AGACCGACTG GGAGAACGAA GAAGCCATCG CGTTTTTGCA GCACCGCCTC
AAGCGCCTCG GCGTGGAGAA GGCGCTTGAG AAGGCGGGCG CCGTGGACGG CGACGAGATC
CGCATTGTCG GCCGAGCGTT CGAATTCGAG TCGGTTCGCA CGGCGGAGGA TCTGTTCAAG
GAGCTCGACC TGTGA
 
Protein sequence
MFIDKVRIHV KGGNGGAGCM SFRREAHVPK GGPDGGDGGH GGNVVVEADA SLSSLIEYRF 
KHHFKAERGT HGKGSRMHGA TGEDLVLKVP MGTVVHEYFE ESKEVGELIA DLTHDGERVT
VAEGGMGGRG NIHFVTPTRR APAFAELGEP SQERWIELEM KLMADAALVG MPSAGKSSLI
AKMSAARPKI ADYPFTTLVP NLGVARSGDY SFVVADIPGL IEGAHEGRGL GHEFLRHIER
TALIVHVVDL TGDYEGRDPL EDYDIINREL ALYADELAAR PRIVVANKID VPGAEEVADR
LAERVREDSI AAAGGDEFAP SPVDPKLYRI SALTGEGVDG LKAAIATKVH ELREELRALS
EADVQYEHVW EHKREERDKQ FKVVPLGGGV FRVEGPQVER MVVQTDWENE EAIAFLQHRL
KRLGVEKALE KAGAVDGDEI RIVGRAFEFE SVRTAEDLFK ELDL