Gene Acid345_4340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4340 
Symbol 
ID4071758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5149378 
End bp5150733 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content61% 
IMG OID637986373 
ProductGTP-binding protein, HSR1-related 
Protein accessionYP_593414 
Protein GI94971366 
COG category[R] General function prediction only 
COG ID[COG2262] GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03156] GTP-binding protein HflX 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.436695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.136604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCAGG AGCGAGCGTT CCTCGTTGGA GTGGAATTCA CGAGCCGGCG GTTCAAGAAT 
GGACCGTCGG ATCGCAAATC TGAAGCAGCC TCACCGATTC CTCCCTCGGC GAAGATCGCG
CGGAGCGCTG CGTCGTCAGC GCCGGTGGCC GACGCTCATT CTGAAGTTCC TCCTGAAGAT
GTTCTCGATG AACTGCGCGA GCTAACGCTG AGCGCGGGCG CCGGTGTGGT CGGCGAAATG
TTGCAGCGGC GCGACCGGCC GGACTCCGCA ACGCTGATTG GCAGCGGCAA ACTGGAAGAG
TTACAGGGTG CCGTGGCTGC GTCGGATGCG GACCTGGTGA TCTTCGATCA CGATCTGTCG
CCATCGCAGC AGAGAAATAT TGAGCGCGCG CTCGAGGCGC GGGTCATCGA TCGCACGCAG
TTGATCCTCG ACATTTTTGC GAAGCACGCG CGGACGGCAG AAGGACAACT GCAGGTCGAG
CTGGCGCAGT TGCAATACCT GCTGCCGAGG CTGGGCGGGC GCGGTATCGA GATGTCGCAG
CTGGGTGGCG GCATTGGCAC GAGAGGCCCG GGTGAGACGC AGTTGGAAAC CGATCGCCGG
AAGATCAATC GGCGGATTCG TCATGTGCAG AAGCAGCTGG AAGATGTGCG GCGAATCCGG
CGGCAGCAGC GCGCGCGACG GGAGAATGTG CCGGTCGCGG TGGTGGCGCT GGTGGGATAC
ACGAACGCAG GGAAGTCGAC GCTGTTCAAC GCACTGACCA AGGCGGGAGT GTATGCGTCG
TCGAAGATGT TTGCGACGCT GGACCCGACA CTGCGCGGAG TGATGTTGCC GTCGAAGCGG
CAGGTTTTGT TGAGCGACAC GGTGGGATTC ATTCGGAATT TGCCGACGAC GCTGGTATCG
GCATTCCGGG CGACGCTGGA AGAAGTCCAG CGGGCGGCGT TGTTGCTGCA TGTGGCGGAT
GCGACGTCCC CGGTGGCGCT GGAGCAGCAG CGGCAGGTGG AAGACGTGCT CGGGGAATTA
GAAGTGCAGG ATAAGCCGCA GATTCACGTG ATGAACAAGA TTGACCTGCT GGCGACGTCG
AAGCGAGCGG CGCTTATCAA CTCGGGCAAA GTTGTGCATG TGTCGGCGAA GAGCGGCCTT
GGGATGGAAG CGCTTCTTCA TGCGATTGAT GAGGCGATCA CGGAAGACCC GGTGAAGACG
GCGCGGTTGA AGATTCCACA GGCGGATGGG AAGGCGTTGT CGTTGGTGGA GGCGAAGGCG
CGGGTGAAGA AGAGGAGCTA TCGCGGGAGC AATGTGCATT TAGAGGTCGA GGCGCCGGAG
AGTGTTTTGA GGAAGCTCGG GGAGTATGTG GTTTGA
 
Protein sequence
MAQERAFLVG VEFTSRRFKN GPSDRKSEAA SPIPPSAKIA RSAASSAPVA DAHSEVPPED 
VLDELRELTL SAGAGVVGEM LQRRDRPDSA TLIGSGKLEE LQGAVAASDA DLVIFDHDLS
PSQQRNIERA LEARVIDRTQ LILDIFAKHA RTAEGQLQVE LAQLQYLLPR LGGRGIEMSQ
LGGGIGTRGP GETQLETDRR KINRRIRHVQ KQLEDVRRIR RQQRARRENV PVAVVALVGY
TNAGKSTLFN ALTKAGVYAS SKMFATLDPT LRGVMLPSKR QVLLSDTVGF IRNLPTTLVS
AFRATLEEVQ RAALLLHVAD ATSPVALEQQ RQVEDVLGEL EVQDKPQIHV MNKIDLLATS
KRAALINSGK VVHVSAKSGL GMEALLHAID EAITEDPVKT ARLKIPQADG KALSLVEAKA
RVKKRSYRGS NVHLEVEAPE SVLRKLGEYV V