Gene Rcas_4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4029 
Symbol 
ID5541539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5227316 
End bp5229421 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content60% 
IMG OID640896141 
Productelongation factor G 
Protein accessionYP_001434080 
Protein GI156743951 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0480] Translation elongation factors (GTPases) 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00484] translation elongation factor EF-G 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00578031 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000445234 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCTCGTG AAGTGCCATT AGAGCGGATA CGAAATATTG GTATCATTGC CCATATCGAT 
GCGGGGAAGA CGACGACGAC CGAACGCATT CTGTTCTATA CCGGTCGAAC GTACAAACTT
GGCGAGGTGC ATGAAGGCAC CGCAGTGATG GACTGGATGG AACAGGAGCG TGAGCGCGGC
ATTACGATCA CCGCTGCTGC CACTACTGCT GAGTGGACGG TCGAAGGAAC GCCGTATCGC
ATCAATATCA TCGACACACC TGGTCACGTC GACTTTACGG CGGAGGTCGA GCGGTCGCTG
CGCGTGCTCG ATGGCGGCGT GGTGGTGTTC GATGCCGTCG CCGGGGTCGA GCCGCAATCG
GAGACGGTGT GGCGGCAGGC GGATAAGTAC CACGTGCCGC GGATCTGCTT CGTCAACAAG
ATGGACCGGA TCGGCGCCAA CTTTATGCGC ACGGTGGACA TGATTCGAGA GCGCCTTGGC
GCAAAACCGG TGCCGGTTCA GTTCCCGATT GGCGCAGAGG ATCGCTTTCG CGGCATCGTA
GACTTGATCA CCAACAAGGC GGTCATCTAC GTCGATGATC AGGGCAAACG CGAGGAACTG
GAGGCGATCC CGGCAGATGT CGCCGACGAG GTGGAACGCC TGCGCAATGA GATGATCGAA
GCAATCGCTG AAACCGATGA TGAGTTGACG CTCCTCTATC TCGAGGGCGA AGAACTCAGC
GTCGAAGAAC TGCGCCGGGC ATTGCGCAAG GCGACGATCC AGGGCAAACT GGTGCCGGTG
CTGTGTGGCG CTGCCCTGCG CAACAAAGGG GTGCAGCGCC TGCTCGATGC GGTTGTCTAC
TATTTGCCGT CGCCGGTCGA TATTCCGCCG GTGCGCGGGA CGCGCCCAGG TCAGATTGCC
GGCGACGATG GCGTTGAGAT GATCACGCGC CCGACATCGG AAGACGCGCC GTTTACCGGT
CTGGTGTTCA AGATCGTCTC CGACCCGTTC GTCGGGAAAC TTGCGTACTT CCGGGTCTAT
TCCGGCAAGT TGGAGACCGG CTCCTACGTG CTCAATTCGA CGCGCAATCA GCGTGAACGT
ATCGGGCGCC TGCTCCAAAT GCATGCCAAC CATCGCGAGG AGATCAAGGA GGTCTACGCC
GGCGATATCG CTGCGATGGT CGGACCCAAG CAAAGCTATA CCGGCGATAC GATCTGCGAC
CCGAATGATC CGATTGTGCT GGAAAGCATC CGTTTCCCGG AGCCGGTCAT TCAACTGGCG
ATCGAACCCA AGACAAAGGC CGATCAGGAT AAACTGGCGG TCGCGCTCGG CAAACTGGCG
GAGGAAGACC CGACATTCCG TGTCTTCACC GACCCGGAGA CCGGACAGAC GATCATCGCC
GGCATGGGTG AGCTTCACCT TGAGGTGATT GTCGACCGCA TGCGCCGCGA GTATAAGGTC
GAAGCCAACC AGGGCAAGCC GCAGGTGGCG TATCGCGAGT CGATCACGGT TCCGGCGGAT
GTGGACAGTA AGTTCGTGCG GCAGAGCGGC GGCAAAGGTC AGTATGGCCA CGTTAAGTTG
CAGGTCGAAC CGCTCGAACG AGGGAAGGGG TTCGAGTTCG TCAACGGTAT CGTCGGCGGC
GTCATCCCAC GCGAGTATAT CCCGGCGGTC GAGGCAGGCG TCAAGGAAGC GATGGCAAGC
GGCGTGATCG CCGGCTACCC GGTCGTCGAT ATCAAAGTCA CGCTCTACGA TGGCTCGTAC
CACGAGGTTG ACTCATCAGA AATGGCATTC AAGATCGCCG CCTCGATGGG GCTGAAGGAA
GCGGTGCGTA AAGGACGTCC GATCCTGCTC GAACCGGTAA TGAAGGTCGA AATTGTGACG
CCGGAAGATT TTCTCGGCGC CGTTCTTGGC GATATCAACT CGCGCCGCGG TCACGTCGAG
GGCATGGAGG CGCGCGGCAA TGCGCAGGTT ATTCGGGCAT ACGTCCCGCT GGCGTCCATG
TTCGGCTATA CGACCGACCT GCGATCGGCA ACGCAGGGGC GCGCTACGTC GTCGATGGAG
TTCGCTTATT ACCAACCGCT GCCGGATGCT CTGGCAAAGG AGATCATCGA AAAACGGCGC
GGCTAG
 
Protein sequence
MPREVPLERI RNIGIIAHID AGKTTTTERI LFYTGRTYKL GEVHEGTAVM DWMEQERERG 
ITITAAATTA EWTVEGTPYR INIIDTPGHV DFTAEVERSL RVLDGGVVVF DAVAGVEPQS
ETVWRQADKY HVPRICFVNK MDRIGANFMR TVDMIRERLG AKPVPVQFPI GAEDRFRGIV
DLITNKAVIY VDDQGKREEL EAIPADVADE VERLRNEMIE AIAETDDELT LLYLEGEELS
VEELRRALRK ATIQGKLVPV LCGAALRNKG VQRLLDAVVY YLPSPVDIPP VRGTRPGQIA
GDDGVEMITR PTSEDAPFTG LVFKIVSDPF VGKLAYFRVY SGKLETGSYV LNSTRNQRER
IGRLLQMHAN HREEIKEVYA GDIAAMVGPK QSYTGDTICD PNDPIVLESI RFPEPVIQLA
IEPKTKADQD KLAVALGKLA EEDPTFRVFT DPETGQTIIA GMGELHLEVI VDRMRREYKV
EANQGKPQVA YRESITVPAD VDSKFVRQSG GKGQYGHVKL QVEPLERGKG FEFVNGIVGG
VIPREYIPAV EAGVKEAMAS GVIAGYPVVD IKVTLYDGSY HEVDSSEMAF KIAASMGLKE
AVRKGRPILL EPVMKVEIVT PEDFLGAVLG DINSRRGHVE GMEARGNAQV IRAYVPLASM
FGYTTDLRSA TQGRATSSME FAYYQPLPDA LAKEIIEKRR G