Gene Acid345_2533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2533 
Symbol 
ID4072177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2989891 
End bp2993022 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content60% 
IMG OID637984550 
Producthypothetical protein 
Protein accessionYP_591608 
Protein GI94969560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.693881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATTT GCTCGCGCTT CCTGGCGCTG GTTTTTGCGC TCCTGGTTAT CTCTTCTTCT 
CTTCATGCTC AGCAGTATTC CGAACAGAAT TTTGGCGCGA TGAAGTGGCG GCAGATTGGG
CCGTTCCGCG GCGGACGTGT GCTGGCCGTG ACGGGCGTGC CGGGGGATCC GGCGACGTTT
TACTTTGGCG CTGTCGCGGG CGGCGTGTGG AAGACCTCCG ATGCGGGCGG GACATGGAAG
CCGATTGCGG ATAAGGACGG GATTGTTTCC GTCGGGGCGA TTGCGGTTTC TGAGAGCGAT
CACAACGTGC TGTATGTGGG GACCGGCGAG GCTTGCATCC GAGGGAACAT TACTTACGGC
AACGGCGTGT ACAAATCCGT GGATGGCGGG CAGAACTGGC AGCACATTGG ACTGGAAGAC
ACGAGGCAGA TTGGGCGGCT AATTGTTGAT CCGAAGAACC CGAACCGCGC GTTTGTGGCG
GCGCTGGGAC ATGCGTTTGG GCCGAATGCG GAGCGCGGTG TGTTCCGCAC GATTGATGGC
GGTAAGACCT GGGAGAAAGT TCTCTACAAG GACGATCAGA CGGGCGCGAT CGATGTGCAG
TTCGATCCGA ACAACGCGAA TACCGTGTAC GCGGCGCTGT GGCAGGTGGT GCGCAAGCCC
TGGAACATGA GCAGTGGCGG GCCGGGGAGC GGGCTGTATA AGTCCACTGA TGGCGGCACG
ACTTGGAAGC GACTGGAAGG GCATGGGCTG CCGGAGGGGA TTTACGGACG GATTGGGATT
GCGGTGGCGG CGAACTCGAC GCGGGTGTTT GCGCTGATCG AGGCGAAAGA GGGCGGCATC
TTCCGCTCCG AAGATTCGGG CGCGACGTGG ACGCGCATCA ACGATGATGA GCGCTATCGG
CAGCGCGCGT GGTACTTCAC GCACATCTTT GCCGATCCCA AGAACATTGA CACCGTGTAT
GTGCTGAATA CCGGTGCGTT CAAATCTACG GACGGCGCGA AGACGTTTGA TCTTTTGCCG
GCGCCGCATG GCGATCATCA TGGGCTGTGG ATTGATCCGC AGAACAGCGA TCGGCTGATC
AACAGCAATG ATGGCGGGGC GACGATCTCG CTCGATGGCG GGAAGACGTG GTCGACGCAG
CAGAACCAGC CGACGGCGCA GTTCTACCAC ATTGTGGCGG ACAACCGGTA TCCGTACTAC
CTGTACGGCG CGCAGCAGGA CAACTCGACC GTCGGGATTG CAACGATGGA TGAGCAGGAA
GGCGTGATTG GGCGCTGGGA TTGGTACGCG GTGGGCGGCG GCGAGAGCGG CTATATTGCG
CCGGACCCGA ACAACGCAAA CATTGTGTAT GCGGGCGATG GCGGAGGCGT GGTGACGCGC
TACGACCGCT CGCGCGAGAC CATCCAGGAC ATTTCGCCGT TCCCACTGGA TACGTCGGGA
CAGGGCGCGG ACAAGCAGAA GATCCGCTTC CAGTGGACGG AGCCGATCAT TATCTCGCCG
CACGATCCGA ACACGATTTA CACGGCGGGC GATCGCATCT TCAAAACTAC CGATCGTGGA
CAGAGCTGGA AGGAGATTTC GCCAGACCTG ACGCGGAATG ATAAGTCGAA GCAGACGCCA
TCGGGTGGGC CAATCACGCT GGACATTACG ACTGTGGAGT ATTACGACAC GGTGTTCACG
GTGGCGGAGT CGCCGAAGCA GAAGGACTTG ATCTGGGCGG GCACAGACGA CGGACTGATG
AAGCTGACGC GCGATGGCGG CGCGCACTGG GAAGACATTA CGCCGAAAGC GATGCCGGAG
TGGAGCACGG TGAGCCTGGT GGAGGCGTCA CCGTTTGATG CGGGGACGGC GTATATCGCG
GTGGATCGGC ATAAGCTGGA TGACATTAAG CCGTACATCT ACAAGACGCA CGATTTCGGC
AAGACCTGGA CGGCGATCAC TGCAGGTATT CCGGAGAATG CTTACGTGCA TGCCGTGCGT
GAGGACACGG TGCGCAAGGG ACTGCTCTTC GCCGGAACGG AGAAGGGCGT TTATGTGTCG
TTCAACGATG GGGCGAATTG GGAGCGGCTG CAACTGAATC TGCCGGTGGT GCCGATTCAT
GACCTCGTGA TTCATGCGAA TGATTTGTCG GTGGCGACGC ATGGGCGGTC GTTCTGGGTG
CTGGATGACA TCACGCCTCT GCGGGAGTTG GACGGTGGGA ATGCGGAGGC GGTGCTGTAC
AGGCCGCGGG AAAGCCATCG AGTGCACTAT CCGGATGGAG TGGATCGGCG GCGACCTGTG
GGGGACAATC CGCCGAACGG AGCGACCTTC TACTATTACT TGAAGGATGC GCCGAAAACT
GAGGCTACGC TGGAGATTCT GGATTCGAGC GGCAAGCTGG TAAAGAAGTT CTCAGACCGC
GAAAAGAAGG CCGCCAACGA GCAGCCGCAA GAGTGGCCGG ACCTGGAAGC GCCTCCGAAC
TTGATTCCGG CGAAGGCTGG GTTGAATCGT TTTGCGTGGA ATTTGCGGTG GGAGGATCCG
ACGCAGACGC CGAGCGCGGT GTACGAAGGC CTGCCGCCGC AAGGACCTGT GGCAGCGCCG
GGGAAGTACA CGATCCGGCT CACAGTGGAC GGGCGGAAGT CTGAACAGCC GTGGGAACTG
AAGGCGGATC CGCGGGATTC TGCGGACGTC GCGCAGGGGA TCGAACAGCA GGTGGCGTTC
GAACTTGAAG TGCGCGAGCG CATCACGAAA CTGCATACGG CGGTGAACCA GATCCGCGAC
CTGCGGGAGA AGCTGGAGAC ACTGAAGAAG TGGGTGGGCG AAAATCCGCA AGGGAAGCAG
TTGCTGGAAC AGGCGGAGGC GCTCGATAAG AAGATGTCGG GCGTGGAGGA ACAGCTGATC
CAGGTGAAGC TGAAGAGCAC GGAAGGGAAC CTGCGGTATC CGAACATGCT GAACGAGCAG
TGGGCCACGT TTGCCGCGTT CATTGATATT GCGGATGCGC CGCCGACTAC GCAGGAGAAG
TCGGTGTACG AGTATCTGTC TCAGCAATCG GATGCAAATA TCGCCAGGTG GGAAGAGATT
CGGAAGACGG ATGTGCCTGC TTTGAATGAG GCGATGCAGA AGAGCGGGGC GGTTAGGCTG
GGGGTGGAAT AG
 
Protein sequence
MRICSRFLAL VFALLVISSS LHAQQYSEQN FGAMKWRQIG PFRGGRVLAV TGVPGDPATF 
YFGAVAGGVW KTSDAGGTWK PIADKDGIVS VGAIAVSESD HNVLYVGTGE ACIRGNITYG
NGVYKSVDGG QNWQHIGLED TRQIGRLIVD PKNPNRAFVA ALGHAFGPNA ERGVFRTIDG
GKTWEKVLYK DDQTGAIDVQ FDPNNANTVY AALWQVVRKP WNMSSGGPGS GLYKSTDGGT
TWKRLEGHGL PEGIYGRIGI AVAANSTRVF ALIEAKEGGI FRSEDSGATW TRINDDERYR
QRAWYFTHIF ADPKNIDTVY VLNTGAFKST DGAKTFDLLP APHGDHHGLW IDPQNSDRLI
NSNDGGATIS LDGGKTWSTQ QNQPTAQFYH IVADNRYPYY LYGAQQDNST VGIATMDEQE
GVIGRWDWYA VGGGESGYIA PDPNNANIVY AGDGGGVVTR YDRSRETIQD ISPFPLDTSG
QGADKQKIRF QWTEPIIISP HDPNTIYTAG DRIFKTTDRG QSWKEISPDL TRNDKSKQTP
SGGPITLDIT TVEYYDTVFT VAESPKQKDL IWAGTDDGLM KLTRDGGAHW EDITPKAMPE
WSTVSLVEAS PFDAGTAYIA VDRHKLDDIK PYIYKTHDFG KTWTAITAGI PENAYVHAVR
EDTVRKGLLF AGTEKGVYVS FNDGANWERL QLNLPVVPIH DLVIHANDLS VATHGRSFWV
LDDITPLREL DGGNAEAVLY RPRESHRVHY PDGVDRRRPV GDNPPNGATF YYYLKDAPKT
EATLEILDSS GKLVKKFSDR EKKAANEQPQ EWPDLEAPPN LIPAKAGLNR FAWNLRWEDP
TQTPSAVYEG LPPQGPVAAP GKYTIRLTVD GRKSEQPWEL KADPRDSADV AQGIEQQVAF
ELEVRERITK LHTAVNQIRD LREKLETLKK WVGENPQGKQ LLEQAEALDK KMSGVEEQLI
QVKLKSTEGN LRYPNMLNEQ WATFAAFIDI ADAPPTTQEK SVYEYLSQQS DANIARWEEI
RKTDVPALNE AMQKSGAVRL GVE