Gene Acid345_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1806 
Symbol 
ID4070493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2184455 
End bp2186461 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content63% 
IMG OID637983815 
Producthypothetical protein 
Protein accessionYP_590881 
Protein GI94968833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCCGG GTTTCGAGAG TGAGATCAGC AAGGGCCCTG GTGGTTTGAC GCCATCAGGG 
CATACTTCTA TCCGAATGAG CGTACAGCAA GATCTCCGCT CCCACCGCCG CTTGTTTCTG
CGGATGAGCG CTTGCCTGCT CTGCACCACA TCTCTTTTCG CTGCCGCTGA TTACACCCGC
GACACGGTTC CCCACCACCC GAAGAAAGCT CAGCCCGCGC GTCCCGCGGC AACGGCCCGT
GTGGCTCTCT TTGAACGGGA CGACCTTCCC CGCGCCACCA AGCTCGCCCA GTTGGAACTC
GCTCGGGCGC CGCACACCAT TGAGCCATGG TTTCTCGCCA TGGAGATCGC TCATGCCAGT
GCCGACGCCC GGACTGAGCT CAAAAGCGCG ACTGAAATCT GTCGTCTTGC TAAAGATGCG
GACCCACGTG GCCTGATCGC TGCCATGCGT CTCGAGGCGA CCGGCCAGAA CAGCGAGGTC
TTCCGCGACC ATCGATCCCA GATCGAAGCC GTCGCCGCAA AAGACTCGGT TTGTTCCGCT
GCCGCGATCG AAGCCCTCTA CACCGCCGGA TTGCAAGGCC TCCCCAACAC TGACTTACGC
GCCCTCGCGC TCGCCTCCGG CTGGATCACT AACTGGAGCA TCGCCCGCGT CAACGGTGGC
TCCGCTTCCG AACGCTTCGA GTTCCCCGAC GCCCGTGTTC GCGTCCCCGA TTACCTGCCG
CGCAACGCCA AGTACGTCGC GCAGGCCGAA TACCTAGCCA CCGAATCTGG CACATTCACC
ATCACCGGCG ATCTCAATGC CACGCAAATC ACGATGGATG ATAAAGGCGT CACCGGCGCC
GTCGCTCTCG AATCTGGCGT CCACCAATTG CGGATCACTT TCCGCCCCGG CGACGTCACG
CCGCGCATTC GCATCGCGAA ATCGCTGACG TTCGATCCTG CAACCGTCGC GACGCTCCAT
CTTCCGCCCC ACGAGTCTGC CTACATCGAA GCCGCAGCGT TGGCCGCTAC CGGCGATCTC
GCTTCTGCCG CCGCCAAGCT TAATACAGCG GATCTCGCCG CCACCCAGAT TGGCCACCTA
CTTCTGGCCA CACGTAAACC AGCGGCTCCG TCCGACCCTC TCGACCACGC GCGTTGGCTC
GCCGGCAACA ACCGCTACCT TGAAGCCCTC GATGAGTTGA ACGAATATGT CCGCGAGTGG
CCGCTTGATC GCGATGCCCG TCGTCTGTTG ATCTCCGAAT TGCAGCGCCT CGGCAACAAC
ACTTCCGCCG ACCGCGCTGC CGCCGAATTT CTCGCGGTCG CACCCAACGC CCGCAACTAC
CGTCGCATGG CGCAGAACGC AACCTCCGTC GACGGCCTTG TGCCTGATCC GCCGTTCTTC
GCCGCTTACC GGCGTCCGGC GCCCGCACCG CTGCAGACCG CCGGCGCGCC CGCGGTCGTC
CTGTTGCAAG ACAAAGTCGC GATCTCGCGT CCCGACGGCA GCGTCTCGCT CTACATGCAT
CGCGTCGTCC AGTTGATGAA CGACGAAGGC ATCCAGACCT TCCAAACGCT CCCGTTGCCC
GAGGGCGCAC AACTCCTGAC CTCACGCATC GTCAACGGCG ACGCCTCTCT CGCCTCTTCT
CCGCAACCTG GCGACGAGAT CGAAGAAGAG TACGTCGTCA ATTACATTGG CGACGGCGGC
ATGGCCGCGC ATCCCGAGGC CTTCCAGTTC GTCTTCAACA CCTTCGACTG CCCATTGCTC
GACGCCCGTT TCGTGGTCCT CTCGCCCTCG GTCGAAACCC CCGGCTACGT CATCGCCAGT
GGCGAAACCC CCGCCTCGCG AACCGAGTTC CGCGACGGCC TCCGCGCCCA GATCTGGGAG
AAATCCACCA CGCCCGATAC GATCATCGCC GACCCCGCCA TCATCCGTGT CGTCGAAAAC
GAAAACGGCT GGTCCACCCC ACCTTCGGTC GAGCGCAAAC GCAACCTCCT GACCATCCAC
CCCGGCCCCC GCAACCGCGA AGCGTAA
 
Protein sequence
MQPGFESEIS KGPGGLTPSG HTSIRMSVQQ DLRSHRRLFL RMSACLLCTT SLFAAADYTR 
DTVPHHPKKA QPARPAATAR VALFERDDLP RATKLAQLEL ARAPHTIEPW FLAMEIAHAS
ADARTELKSA TEICRLAKDA DPRGLIAAMR LEATGQNSEV FRDHRSQIEA VAAKDSVCSA
AAIEALYTAG LQGLPNTDLR ALALASGWIT NWSIARVNGG SASERFEFPD ARVRVPDYLP
RNAKYVAQAE YLATESGTFT ITGDLNATQI TMDDKGVTGA VALESGVHQL RITFRPGDVT
PRIRIAKSLT FDPATVATLH LPPHESAYIE AAALAATGDL ASAAAKLNTA DLAATQIGHL
LLATRKPAAP SDPLDHARWL AGNNRYLEAL DELNEYVREW PLDRDARRLL ISELQRLGNN
TSADRAAAEF LAVAPNARNY RRMAQNATSV DGLVPDPPFF AAYRRPAPAP LQTAGAPAVV
LLQDKVAISR PDGSVSLYMH RVVQLMNDEG IQTFQTLPLP EGAQLLTSRI VNGDASLASS
PQPGDEIEEE YVVNYIGDGG MAAHPEAFQF VFNTFDCPLL DARFVVLSPS VETPGYVIAS
GETPASRTEF RDGLRAQIWE KSTTPDTIIA DPAIIRVVEN ENGWSTPPSV ERKRNLLTIH
PGPRNREA