Gene Acid345_1818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1818 
Symbol 
ID4070505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2198124 
End bp2199461 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content58% 
IMG OID637983827 
Producthypothetical protein 
Protein accessionYP_590893 
Protein GI94968845 
COG category[R] General function prediction only 
COG ID[COG2403] Predicted GTPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGG TATTGATCCT CGGCGCCGCC GGCAGGGACT TCCACAACTT CAACGTTGTC 
TTCCGTCATA ACCCCGAATT TCAGGTCGTA GCCTTCACCG CAACCCAGAT CCCCGACATT
GCCAACAAGA CCTATCCCGC AGTGTTAGCA GGGCCGCACT ATCCTAAAGG GATCCCGATT
CTTGAAGAAG ACCAGATGGA AAAGATCATT CGCGAGCAGA GCGTGGATGT GGTGGTCTTC
TCCTATAGCG ATGTGAAGCA CGAAACCCTG ATGCACCTGG CCTCGCGGGC TGTGGCTGCC
GGCGCTGATT TCTGGTTGCT GGGTGCGGAA CGCACCGAGT TGAAGTCGAA GGTTCCCGTC
ATTTCCGTCT GCGCTGTCCG CACCGGATGC GGAAAGAGCC CGGTATCGCG CAAGATCGCG
CAGGAACTTA AGAAGCACGG CTGGAAGCCG GTCGTGATTC GCCACCCCAT GCCGTATGGT
GACCTCGCGA AGCAGACCTC GCAGCGTTTT GCCACCATGG AAGACCTGGT GAAGCATGAT
TGCACGATTG AGGAGCGGGA AGAGTACGAA CCGCACATCA TGGAAGGCCG CGTTCTGTAT
GCAGGCGTGG ATTACGAAGA GATCATGCGG AACGCCGAGA ACGAGGGCGA CATCATCCTG
TGGGACGGCG GCAACAACGA CACTCCGTTC TATAAGAGCG ACCTCGAGAT CGTTGTTCTT
GATCCGCATC GTCCGGGCCA CGAGCTGACC TACTATCCTG GCGAAGTGAA CTTCCGCGCC
GGCACCGTGC TGGTGATCAA TAAGGTTGAT ACGGCGAACC GCGAGAACAT CGAGATCGTT
AAGAAGAACA TCGCGAAGTT CAATCCGACC GCCCAGGTGA TTGAGACTGC ATGCAAGGTC
ACGATTCCGG CAGCGGAAGA AGTTCGCGGC AAGCGCGTGC TAGTGGTAGA AGACGGTCCG
ACGCTTACCC ACGGCGAAAT GCCCTACGGC GCTGGCGTGG TGGCAGCGAA GAACTACGGC
GCTGTGACAA TGGTCGATCC CCGTCCCTTC GCGGTGGGCT CGATCAAGAA GACCTTTGAG
AAGTTCCCGC ATCTCGGACG CGTTCTGCCT GCCATGGGCT ATAGCGACCA GCAGCGGCAC
GAACTCGAAC TGACGATCGC CAACACGCCG TGCGATCTGG TGCTGATTGC GACGCCGATT
GACTTGGCAA AGGCGATCAA GATCGAGAAG CCAACGCTCC GCGTGAGCTA CGAAGCTGTC
GAGATGGGAG GACACGAACT GACCGACCTG ATCGATAAAT TTACCGCTGA GCACAAGGGC
GTGGCGGTGG GGAGATAG
 
Protein sequence
MKKVLILGAA GRDFHNFNVV FRHNPEFQVV AFTATQIPDI ANKTYPAVLA GPHYPKGIPI 
LEEDQMEKII REQSVDVVVF SYSDVKHETL MHLASRAVAA GADFWLLGAE RTELKSKVPV
ISVCAVRTGC GKSPVSRKIA QELKKHGWKP VVIRHPMPYG DLAKQTSQRF ATMEDLVKHD
CTIEEREEYE PHIMEGRVLY AGVDYEEIMR NAENEGDIIL WDGGNNDTPF YKSDLEIVVL
DPHRPGHELT YYPGEVNFRA GTVLVINKVD TANRENIEIV KKNIAKFNPT AQVIETACKV
TIPAAEEVRG KRVLVVEDGP TLTHGEMPYG AGVVAAKNYG AVTMVDPRPF AVGSIKKTFE
KFPHLGRVLP AMGYSDQQRH ELELTIANTP CDLVLIATPI DLAKAIKIEK PTLRVSYEAV
EMGGHELTDL IDKFTAEHKG VAVGR