Gene Acid345_0854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0854 
Symbol 
ID4070987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1064014 
End bp1065156 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content57% 
IMG OID637982863 
Producthypothetical protein 
Protein accessionYP_589933 
Protein GI94967885 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATA GCGTTCAACT CAACCTGATG ATGGGACCGT TTTTCCCTAT GACGCCGCCG 
CGCCCAGTCA TGGACGCCCT CGACAGTGTT GAGGTGACGG TGAATGACAC CGGCACAAGC
GGATTCCAGC TTACGTTCTT GATCGACAAA CAATCGCCGC TCAACATCAT GTTCTTGCTC
ACGGGTGGAC TCCCACTTCT ATTCATGCGG GTTGTTATCG TCGCCATCGT GAATGGAGTC
TCGAATGTCC TGATCGACGG AGTCATCACC AACAACCACA TCTCTCCCGG AGACAAGGGC
TCGAACTCGA CACTGACCCT GACCGGCGAA GATCTCACCG CGCTCATGAA CCAGTCCGAT
TGGAGCGGTT TCCCCTTCCC GGCCTGCCCT GCGGAAGCGC GCGTCGCTCT CATCTGCGCG
AAGTATGCGA TTTTCGGCGT CATTCCCTTG ATCATCCCCA GCGTGTTAAT CGACGTGCCG
TTGCCGATCG ACATGATCCC CAGCCAACAG GGCACCGATC TCGCCTACGT TCGCGCGCTC
GCCGATCGCG TCGGATACGT CTTCTATATC GATCCCGGAC CGGCTCCAGG CATCAGCAAA
GCCTACTGGG GACCACAAAT CAAGTTCGGC GCAATTCAGC CCGCCCTCAA CATCGACATG
GATGCATACA CCAACGTCGA AAATCTCACC TTCAATTTCG ATCAGCAGCA GAACCGGATT
CCGATCGTCT ACATTTACAA CCAGCAAACC GGCGTTTCTA TTCCGATTCC AATTCCGCCG
ATCACGCCCC TAAATCCGCC ACTCGGACTG ATTCCGCCAC TGCCGTCGAA CATCCCACCC
GATCTCACGC CGATCCGCGA CGACCTTTCG AAGCGCCCAA TCCCCCAGAC CATCATGATC
GGCCTAGCTG CAGCGTCGCA ATGGGCAGAT GCAGTTACCG GTGAAGGCAC CCTCGATGTT
GTGCGTTACG GCGGAGTTCT CAAAGCCCGC GAGCTCGTAG GCGTGCGAGG CGCGGGACCT
GCCTTCGACG GTCTTTATTA CGTAAAGAGT GTCACCCACA AAATCAAGCG TGGCGAATAC
AAGCAGAGTT TCAAGCTGAG CCGTAACGGC TTGGTATCCA CAGTTTCCAC GGTGCCCTCA
TGA
 
Protein sequence
MLDSVQLNLM MGPFFPMTPP RPVMDALDSV EVTVNDTGTS GFQLTFLIDK QSPLNIMFLL 
TGGLPLLFMR VVIVAIVNGV SNVLIDGVIT NNHISPGDKG SNSTLTLTGE DLTALMNQSD
WSGFPFPACP AEARVALICA KYAIFGVIPL IIPSVLIDVP LPIDMIPSQQ GTDLAYVRAL
ADRVGYVFYI DPGPAPGISK AYWGPQIKFG AIQPALNIDM DAYTNVENLT FNFDQQQNRI
PIVYIYNQQT GVSIPIPIPP ITPLNPPLGL IPPLPSNIPP DLTPIRDDLS KRPIPQTIMI
GLAAASQWAD AVTGEGTLDV VRYGGVLKAR ELVGVRGAGP AFDGLYYVKS VTHKIKRGEY
KQSFKLSRNG LVSTVSTVPS