Gene Acid345_0954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0954 
Symbol 
ID4070836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1213617 
End bp1214696 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content63% 
IMG OID637982961 
Producthypothetical protein 
Protein accessionYP_590031 
Protein GI94967983 
COG category[R] General function prediction only 
COG ID[COG0701] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.416811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCGA GCGCCTCTGC CGCACCCGCT TTTGCTGTCG CCGATGTCCC GAAGAAATCT 
GCTGTCACCA CTGCGCAGAT CGTTACCGTC ATCTTCCTCG GCCTCGCCAT GGCGCTCTAT
TTCTGGGTTG ATTCGCGATA CCCCTCATTG ATGAAGAAGT ACCACGCCGG CCACGCGGTG
AAGGCTGCGG GCGCCATCAG CTTCGACGCG ATCCTTCCCG TCAACCCAAC CATGCCGCTG
ACGACACGCA TCGTGCGTAC CTCCGGCAAT TGGCTCTATA CGAACCGCAT CGGGATGAGC
TTCGGTATGG GGTTCGGCGC GTTGCTGCTC ACGCTCCTGC CCATGTTCGC GCGCCGCCGG
TTCAAAAGCG GATTTGCCAA TACCGTGCTC GGCGTCGCAG CGGGTGCGCC GCTCGGCGTG
TGCGCCAATT GCGTCGCGCC CATCGGACGC GGCCTGGTGC AGGCCGGCGC TAGTCCCAAC
ACCGCGCTCG CGACCATGAT CAGTTCGCCC ACGCTTAACG TTGTCGTGCT GGCGATGGCA
TTCAGCCTGT TCCCGCTGCC GGTTGCCATC ACCAAGATTG CGACCGTGCT CGCGCTGCTC
GCGCTGGTGC CGTGGTTTGC GCCAAAGCCC GAGCCGGAAT TCGCCTGTGA GATTCCGCAA
TCGGCAGCCG CCGGATCGGC CGTAGTGCTC TTCCTCAAGA ACCTTGCGAA GATGATCGCG
ATCACGCTGC CGTTCATGGT GCTCGCCGGC GTTCTCGGCG CAATCCTCGC CGAAGCCCTG
CCATCGAGCA GCCTGCCCGC GCACGTTTCG ATTCTCGGAA TCATTCTTGT CGCGCTGATC
GGCGCATTCC TGCCGGTACC GATGGCTTTC GACGTCGCAA TCGCGTTCGT ACTGATGTCG
CGCGGGGTGG CGCTGCCCTA TGTCGTGACG CTACTCTGCA CCCTCGGCTG CTTCAGCATT
TATTCGGCGC TGATCGTGGG CAAGAGCCTG TCGTGGAAGA CCGCCGGCAA GATGTACGGC
ACGGTGGCCG CGCTGGGAAT CGTCGCGGGA TTGGTGACCG CGGCGTGGAG CGGATTCTAG
 
Protein sequence
MSSSASAAPA FAVADVPKKS AVTTAQIVTV IFLGLAMALY FWVDSRYPSL MKKYHAGHAV 
KAAGAISFDA ILPVNPTMPL TTRIVRTSGN WLYTNRIGMS FGMGFGALLL TLLPMFARRR
FKSGFANTVL GVAAGAPLGV CANCVAPIGR GLVQAGASPN TALATMISSP TLNVVVLAMA
FSLFPLPVAI TKIATVLALL ALVPWFAPKP EPEFACEIPQ SAAAGSAVVL FLKNLAKMIA
ITLPFMVLAG VLGAILAEAL PSSSLPAHVS ILGIILVALI GAFLPVPMAF DVAIAFVLMS
RGVALPYVVT LLCTLGCFSI YSALIVGKSL SWKTAGKMYG TVAALGIVAG LVTAAWSGF