Gene Acid345_0886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0886 
Symbol 
ID4069136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1103623 
End bp1104861 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content56% 
IMG OID637982893 
Producthypothetical protein 
Protein accessionYP_589963 
Protein GI94967915 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.500589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGTTC GTCCACTCAA TTCGCTCCCA GCCGTGATAC TCGCGGCGTT CATGCCGGTT 
CGATCGCCGG AGTTGGAGCA GGCATTTGCG AAGTTTGCGC TTCGTGATTG GCGGAAGCTT
GGAAACTGGA TCCATGCGAG TGGACTTGCG CTGTACTTAT TGGACGAACT GAAATCGCGA
GGCATCGGAG ATGCGTTGCC CGATGCGATT CGATCGGAGC TTGAAGAAAA CCTGGCGGAC
AATCTTGAGC GGACCAGCGC GCTCATTGTC GAGTTCTTGC GGCTGAATTC GGAGTTTGCC
GCGGTGAAGC TCGATTACCT CTGTATCAAG GGCTTCACGC TTGGCGCTGT CTACTGCAAG
AGGCAGCAGC TACGGTCACA GTTCGACCTG GACTTTTGGG TCAGAGAAGA TCAGGCACGA
GAGTTTACGA AGCTGATGCG TCGTCTCGGG TACGACGTGA AGGCATTCGA TCGCGTGCTC
GAGTGCCACA CGAACGGTAC TCCGTATCCG AGGTTCCAAG ACTTTTACAA GCCTCCTCAG
CGCAAATCGG TAGAAATCCA CCTGCGCTCT GTGAGCGAGT TCGATCAAGT TCCTCGCGCA
AACGGTGTGC TCAACAACGT AATTTTCCCG GCATTGGCGC GCGAGCGGAT GTTCGTTGAA
CAGGCGTTGC ACCTCACAAA ACATTTTTGC AGCGAGTGGA CCCGCGCCTC CTGGGCATTG
GAGTTGAACC GAGCAATTCA CGGCCCCGAG ATCGGTGCAG AGTTCTGGTC GGCCGTGAGG
GAGCAGTGCA CAGAGGTTCA AGCGATCCTT CTGGGTATTG CGGTCGCTTC GTGCGCCCAT
ATCTGGAAAT GTTCCGCGCC GGTCGAACTC AATTGGGCCA CCAAGGCTTT GCCAAGTGGC
GTAGTGCGAT GGATCGATGA GTATGCGCAA TCAGCGGTGA CTGCGCGTTT TCCGGGATCG
AAGTACTACC TTCTTCTTGA GAAGGAGTTG GAAAAGGACT CGATCGATTA TCGACGCCAT
CGTCGAGCAG CGCTGCTGCC TTTTAGGATG CCCGGATACG TCACCAACGA ACGACGTCTG
ACACTCACCG AGATTCCCTC CCACATCAAG TATGTCGGAA AGCGTTTCGC ATTCCACGTG
CGAGAGGGAG CAAAGTTGCT TCAGGCGGAA CGTAATTGGT TGGAGCGAAC GCGAAGCGCG
AAAGTATACG CGCAGCGTGA AGGTGACAGC ATCGCGTAG
 
Protein sequence
MLVRPLNSLP AVILAAFMPV RSPELEQAFA KFALRDWRKL GNWIHASGLA LYLLDELKSR 
GIGDALPDAI RSELEENLAD NLERTSALIV EFLRLNSEFA AVKLDYLCIK GFTLGAVYCK
RQQLRSQFDL DFWVREDQAR EFTKLMRRLG YDVKAFDRVL ECHTNGTPYP RFQDFYKPPQ
RKSVEIHLRS VSEFDQVPRA NGVLNNVIFP ALARERMFVE QALHLTKHFC SEWTRASWAL
ELNRAIHGPE IGAEFWSAVR EQCTEVQAIL LGIAVASCAH IWKCSAPVEL NWATKALPSG
VVRWIDEYAQ SAVTARFPGS KYYLLLEKEL EKDSIDYRRH RRAALLPFRM PGYVTNERRL
TLTEIPSHIK YVGKRFAFHV REGAKLLQAE RNWLERTRSA KVYAQREGDS IA