Gene Acid345_0857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0857 
Symbol 
ID4068951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1066208 
End bp1069390 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content58% 
IMG OID637982866 
Producthypothetical protein 
Protein accessionYP_589936 
Protein GI94967888 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCA CCGCCTTGTC ACACTCCAAC TCGCAGCAGG AGAACACTTC CTCGAATTCG 
TTGCGCGTCG GCGACCCCAA TAGCTCATTC GAACGGGAGG CGGACCGCTT CGCGGATCAA
GTCATGACTG GGGAGTCTCA CAAGAGTCCA TGGTCGCTAT CTCGCGTCAG CCTCGGACAA
ACCCTACAGC GCAAATGTGG TTGTAAAGGG TCGAGCGATT CGAAGGACGA ATGCGATGAC
TGCAAGAAAA AGAAGCTTTT GCAGCGCAAG CCGGCAGCAC TCGGAGCCCC ACATGTCGCT
CCGCCCATTG TGCATAGCGT TCTGAATTCG AGCGGCCGAT CTCTCGATCA TGCAACAAGA
TCGTTCTTCG AGCCGAGATT AGGCGTCGAC CTCGGAAGCG TGCAAGTACA CGACGATGGT
CGTGCAGCGA GGTCCGCAGA GGCGGTCAAT GCCCACGCCT ACACCGTCGG CAATTCAATT
GTGTTTGGCG AAGGCCGCTA CCAGCCTGAA AGCGCTGAGG GCCGTCGCCT GCTGGCCCAC
GAGTTGACCC ACGTCGTTCA CCAACAGGGT CCAAGCGAAC GTCTGGTCCA GCGGGAAGAG
GATTCGGATG TTGAAGTCGC TGACGCGCCT CGAACCGGTT CGCTGATCGA AAAGGCTTTT
GATGCCGCCG ACGCAGCGCA CTGGGAACAA GCTGCCGAGT TGGCAAATGG ACTAAGTGCC
GGCGATCTAA AAGCGTTCAT CGGCTCCTTG GGCGCGGGCT GGAAGACCGA GCAACTCCAC
ATCGGCGCCA TCAGCAACGC TCGCGTTGGG CCTGGCTCTG CGGTCGCGAA GATGACGCAT
TGGGCATTCC TCAACGCCAA ATTTTCGGAA CAGATGAAGG GCGGGTTCTA CCAGGCAGCG
TCGGAATATC TCAACGGATT TAGTCAGGGC GAGATCCGCT CCCGCATCAG CAAAATGAAA
ACTGAGGTTG TCGCCGGCCT GCACAGCGGC GCCGTCGCTC AGCCCGGGAT TGGCGCAGAT
TCGAACGCCG CCAAGATCAC TGGCGAGGAA CTCGATAAGC GTAAAGAAAA AGGCGATGCC
GCCGCCGATG CCGCAACCAA GGCAGCCATT CCCGAGAATC CTCAACAGAA AAAGAAGCGT
TGCCAGGACA CCGCCGGCCA AGGCTTCAAG ATCTTCCCTC TCCGCTTGCC CAAAGGTATG
TGGCAGCTTT CAAACGCGCC CATCGGCGCG GAACGCAAAG GCGACGAGAT CCTCGTCAAG
CAGCCTCTGA ACGACGTCAA AGGCGATCCG ATGTTCCGCC GCGAAACGAA AACTCTGCCG
CTCGAAACCT TCCTCGGGGG CATTCGCTTG AAGAAGGACG AAGTCGTTGG CGTGCGCCTC
TATGACGACA AAGAACGCCT CGTCTGCGTA ACCGGCGAAG ACATGCTCAA GTTCAACGAC
GCGACCGAAA TGGCGCTCTG GTTCAGCGTC GGCCGCACCG CTTTGGATGC CGCAACCATC
TTCGCGCCGG GTGCGAGCGC CGGTGCAAGC AAGGTAACCG GCTTCGCGGT TGGCAACATC
GTCGCGGGTG AACTCCTCGA TGTCGGCCGT CAGGAAATGG AAGTCAAGTA CGGACTGCGC
GAAGAAGTCG ACTGGGGTGG CATCGCCTTT GACACTGTCT TCCAACTCGC GACCCTGGGC
TTCTCAAAAT ATCTGAATAA TGCTGCCACG AAAGCTGTGC TCGGAAAGGC TCCGGAACTC
GGGCAGAAAC CAGCGCAACT TGCCGTCCAT GCCGCTCTCG CCGGCGCCAC CAACTTGGTG
CAGACCGCCG CGAGAACCGC GTTCGACATG CTGCGAAAAG AGAAAAAGAA ATTCGTGATG
GAAGATTTCC TGATCGAACT CGCGCAGGCC TTCGCTACAG GAACACTTTT TGCATTCGTA
CATGGCGCTG CCGTTCACGA AGAAGGATTG CCGCAAGAGA AGCAAGCTGC CCCATCCGAG
CATCAGCAGG GCGCACCACC TGTACACGAT CAAGCGGCTC CACCACTACA CGATCAAGCG
GCCACGCCAG TGCACGATCA AACCGCGGCT CCAGTCGCTA AGCCGCAGAA GAAAACGCCC
CCCGTTCATA CGGACGAGCA CGTGACGACA GCGCCTCCCG ACAAGGCACC TGGCGAGCGC
AAAGGCGCTG CGCCTCTTCA CGAAGACACA CCACCGGCCA CCACGCAGAA ACGCGCAATC
GGTGCGCCTC CAGAAGAGCA CGCCGGAACG CCTAGCGCGA AGAAGACGGA CGGACCGGAA
GGCCAAACTG CCGCGGCCGT ACAAGAGAAG GACGCAACCG CCAAGAAGAA GACTGCGGAT
GGCAAACACG ATGTCGTCGT TACCGAGCAG GGTGTCGGGA AGTGTAGTCC TCCGCCTTGC
CCTGTGATTC ACGTGGAATA CAAGAAGGAA CTTGATGCCC ATCCTGAGTT CAAGGAGTGG
AACGAATCGG TCCAGAACAT GCGTAAGGCC GATCCCGAGT TTGCCGCCGA ACAGGGCAAG
AAGCTCATCG CCGCGTTGGA AGACGTACGT GCAAATGGAG GGAAGCTCTC TGGCGAAAAA
CTAGTTCAGC ATCGCGAAGC CGCTTTGCAA GCGCGCCTCG CAGAAGCCGA AGGAGATCTG
CACAAGGCAC GTTGGGACAC CATCGACTAT CAAGCCGAGC GAGCCGCGAC TGGCAAGAGC
AGGAAGGGCG GGCCGATCAA GGGACTCTGG AATGTCAAAG AACGGATATG GGCAATCAAG
AGGCAGATGG CCTATCCGAA TCGCACGATT CTCGAACAAG CGCACATTGT CGGTGTGCGT
GCTCCTGACG GAACAATCAA GCCCACAAAC GAAATCGGCA AAGGTGGACG CATCCCGGAT
TACGTAGAGG TACGCGGCCA GAAGATTGTG GCGGGCGACC TCAAGTCCGG AGAGGAATTC
AAGAAAAGCA TCGCCGGCGG ACTGGCGAAA CCAGGCGAAA TCGAAGCGGA GTTCCGCAAG
AGCGCGAAGA TCGCGCAGCA ACAAGGCGTA GAAGACAAGG TCCTGAATGC AGCAAAAGGA
AATGGCGGAA AGATTGTGAT CGAGGGCTTT GACGTAACAA CTGGCGAGAA AGTCGTGAAA
GAAGTGGATC CGGCCGATTA CGGGTCGGAA GTAATTACCT ACGACGACGT TCGCACCAAC
TAG
 
Protein sequence
MSRTALSHSN SQQENTSSNS LRVGDPNSSF EREADRFADQ VMTGESHKSP WSLSRVSLGQ 
TLQRKCGCKG SSDSKDECDD CKKKKLLQRK PAALGAPHVA PPIVHSVLNS SGRSLDHATR
SFFEPRLGVD LGSVQVHDDG RAARSAEAVN AHAYTVGNSI VFGEGRYQPE SAEGRRLLAH
ELTHVVHQQG PSERLVQREE DSDVEVADAP RTGSLIEKAF DAADAAHWEQ AAELANGLSA
GDLKAFIGSL GAGWKTEQLH IGAISNARVG PGSAVAKMTH WAFLNAKFSE QMKGGFYQAA
SEYLNGFSQG EIRSRISKMK TEVVAGLHSG AVAQPGIGAD SNAAKITGEE LDKRKEKGDA
AADAATKAAI PENPQQKKKR CQDTAGQGFK IFPLRLPKGM WQLSNAPIGA ERKGDEILVK
QPLNDVKGDP MFRRETKTLP LETFLGGIRL KKDEVVGVRL YDDKERLVCV TGEDMLKFND
ATEMALWFSV GRTALDAATI FAPGASAGAS KVTGFAVGNI VAGELLDVGR QEMEVKYGLR
EEVDWGGIAF DTVFQLATLG FSKYLNNAAT KAVLGKAPEL GQKPAQLAVH AALAGATNLV
QTAARTAFDM LRKEKKKFVM EDFLIELAQA FATGTLFAFV HGAAVHEEGL PQEKQAAPSE
HQQGAPPVHD QAAPPLHDQA ATPVHDQTAA PVAKPQKKTP PVHTDEHVTT APPDKAPGER
KGAAPLHEDT PPATTQKRAI GAPPEEHAGT PSAKKTDGPE GQTAAAVQEK DATAKKKTAD
GKHDVVVTEQ GVGKCSPPPC PVIHVEYKKE LDAHPEFKEW NESVQNMRKA DPEFAAEQGK
KLIAALEDVR ANGGKLSGEK LVQHREAALQ ARLAEAEGDL HKARWDTIDY QAERAATGKS
RKGGPIKGLW NVKERIWAIK RQMAYPNRTI LEQAHIVGVR APDGTIKPTN EIGKGGRIPD
YVEVRGQKIV AGDLKSGEEF KKSIAGGLAK PGEIEAEFRK SAKIAQQQGV EDKVLNAAKG
NGGKIVIEGF DVTTGEKVVK EVDPADYGSE VITYDDVRTN