Gene Acid345_0555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0555 
Symbol 
ID4073044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp683108 
End bp684250 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content57% 
IMG OID637982560 
Producthypothetical protein 
Protein accessionYP_589634 
Protein GI94967586 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.11052 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGACG ATCCCATCGA TTTTGCGCCT CAGACCGAAG ATGGCGTTCC TGAAATCCCG 
CAGGTGGCGT TGGACCGTTC GACAGCCGAA GAACTCGAAC TGCGCAAGAT GCAGAAGACG
CAGACGACGT GCCTCTTCAT TCTCGCGCTC GCCACCGGTC TTGGACTCGC GTACGTGGCG
AAGATGGTGC TGGTGGTGCT GTTCGTTTCG ATCCTGGTGG CGTTCGTGCT CGCGCCCGTG
GTCGATTTCG GCGTCCGCTT TGGCGTTCCG CGGTCGCTGG CTTCGTTGTT CGCAGTTTTC
CTCTTGATCG GTGTGATCTA CGGCATCACG TTCATGTCGT ACAGCAAAGG CGTGGACTTC
ATGCAGGACC TGCCCAAGTA CAGCCGGCGC ATACGTGAGG TTGGCCAGCG ATGGGAACGG
CGGGCGGAAG CATTTCGGAG GAGTACCCAG GATATCGTTC CGCAATCGGA AGATGACAAG
AGGGCGGTTA CTATCCGCCA GCAGTCGAGC CTGTCCGACA CAGTTACAAC CGCTCTCGGC
TCCGTGGGCG AAATCTTCTT CACCATCAGC TTTATTCCGT TCCTTGCGTT TTTCATGTTG
AGTTGGCAGG AGCACGTCAG GTCGGCGACG GTGATGCTGT TCAAGATGGA GAACCGCAAC
ACCGCTTACG TCACGCTGGG GCTGATGTCG AGCATGATCC GAAGTTTTCT GGTCGGGAAC
CTGCTCGTCG GGCTCTTTGT CAGCGTAGTG AGCATGATCA TCTTCGGACT GTTGGGAGTG
CCGTTCTTCT ACTTCGTGGG CTTCATCAGC GGGTATCTCA GCATGGTGCC GTACTTGGGC
ATCGTGCTGG CACTCATCCC GCCGGTCATC ACCGGAATGG GAGTGATGAG CCTTGAGAAG
CTGGTAGTGA TCATCGTTTC TATCCTGAGC TTGCACCTGT TCGCGATGAA CGTGCTCTAC
CCGAAGGTGC TCGGGAAACG GCTGCAACTC AATCCGCTGA CGGTGACGAT CGCGCTGTTG
TTCTGGGGTT GGTTGTGGGG CGCGATGGGA TTGATCCTCG CGATCCCGGT TACGGCCGCG
ATCAAGATCG TACTCGACCA CGTGGAAGGA TTTGAGGGCT ACGGGCAGTG GATGGGCGAG
TGA
 
Protein sequence
MEDDPIDFAP QTEDGVPEIP QVALDRSTAE ELELRKMQKT QTTCLFILAL ATGLGLAYVA 
KMVLVVLFVS ILVAFVLAPV VDFGVRFGVP RSLASLFAVF LLIGVIYGIT FMSYSKGVDF
MQDLPKYSRR IREVGQRWER RAEAFRRSTQ DIVPQSEDDK RAVTIRQQSS LSDTVTTALG
SVGEIFFTIS FIPFLAFFML SWQEHVRSAT VMLFKMENRN TAYVTLGLMS SMIRSFLVGN
LLVGLFVSVV SMIIFGLLGV PFFYFVGFIS GYLSMVPYLG IVLALIPPVI TGMGVMSLEK
LVVIIVSILS LHLFAMNVLY PKVLGKRLQL NPLTVTIALL FWGWLWGAMG LILAIPVTAA
IKIVLDHVEG FEGYGQWMGE