Gene Acid345_2309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2309 
Symbol 
ID4071463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2734125 
End bp2736251 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content58% 
IMG OID637984325 
Productcarbohydrate binding protein 
Protein accessionYP_591384 
Protein GI94969336 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.427947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAACGAT TCAATACTGC CGTTGTTCTC CTATCCCTGT CCTTGGCAAC AACTACTGCT 
TTCGCGCAAC CTGCTGTCGG CACTATTCGC GTGGATACCA ATCCTCAACA CATTTTGAAC
TCTTTCGATC CCGACCAGGC CCTCGGCAGC TCCGTGGACG TCCTTTCGCA CTACGACATT
GAGAAGGTGT ACTCGCCGCA CATTCTCCAG GAATCGCTCT CCGCTGGATG GGGGCCGATC
ACCTATCGAA ATAACAGCGA GTTGCGCATG GCCTATTTGC ACTGGAACGC TGAAGGGACC
TGGAGCGACC CAACACATCG CAGCGGATAC TTCATCGGCA GCACCGAGCT CAAAAAGCCA
ATTCGCTACA GTCTTTCCTA TGGTCTTCCC CATCGAGGCT TTGCCACCAG CGGGGATCGC
CCAATCGAAA GCCCGAACCT TACCTACTGG AAAAGCAACC CCTATTTGAC CAGCAAATTC
ACCGGCGAGA GCGACGCGCT CCATCCGCAA TGGGTCGTGG TGGATTTGCA ATCCGAAAAA
TCGGTGAACG CCGTTCGCAT CCAGTGGGCC GAACCCTACG CAGCTACCTA CACCGTCGAT
TACTGGGTTG GCAAAGAACC TCTGGAGTGG GACGGTGGCC CCAAAGGCGA GTGGAAAACC
TTCGCATCGG GCGAACTGAA GCACGCAAAA GGTGGCACCG CGACGTTGAA ACTCGCCGCT
GCCCCAGTCT CCACCCGTTA CGTCCGAGTT TGGATGACCG AGTCTTCCAA CACCTGCGAC
GAGCACGGCG CGGAAGACTC GCGCAACTGC GCCGGATATG CCATCCAATC GATTCAAGCC
GGCAGCCTCG ATAAGCCCGG CCATTTCGTC GAAGCACCTC GTGCGGCTTC CGACAAAGAG
ATCACTTATA CCGCTTCCTC CATCGATCCC TGGCACTCCT CCGAAGACGT AAAGGACGAC
GGCTTTTACC AGCACACCGG CTTTGACCTG TTCTTCACCA GTGGGATCAC CAACGGCTTG
CCCGCGACGA TTCCTGTCAC CATGCTCTAC GGCACGCCGG AAGACGCCGC GGCACAAATC
GCTTATATCC AGAAACGAGG CTATTCCATC GGCTACGTCG AACTCGGCGA AGAGCCTGAC
GGCAAGCACG CCATGCCGGA GGACTATGCG GCCCTTTACA TCCAGTGGGC GAAGGCCCTG
CACCAGGTTG ACTCGAAGTT AAAGCTCGGC GGCCCGATCT TCGAGGGCGT CAACAAGGAC
ATCACTCTAT GGCCCGACGC CAAGGGCCGC ATCTCTTGGA TGGGGCGCTT CGTCGCTTAC
CTCAAAGACC ACGGCCATCT CGACGATCTC GCCTTCGTAT CGTTCGAACA CTATCCGTTC
GAGCCCTGCG AGATCACTTG GAAGTCGCTC TACGAAGAGC CGCAGTTGAT GAAAGGAATT
CTGAAAACCT GGCGTGATGA CGGCGTGCCG GCCAACGTTC CGCTCATGGT GACCGAGAAC
CATCTCGCCG CGCAGCTTAC CGGACCCATG ACGACGATCT TCGCAGCACT CTGGCTCTCC
GACAACGTGG GCTCTTTCTT CGAAGGCGGC GGAGCGGTCT TCCATCACTC GCCTATCCAG
CCGCAGGACA TCCATGAAAC CTGTCTCGGC TGGGCGTCGT GGTCGAACTT CGTCGCCGAC
GAGCGCTACA ACATCAAGGG CTATACCTCG CCCTGGTTTG CCGCACGCAT GATCAATATC
GAGTGGGTGC AGCATCGCTC CGGCGTGCAC CAGATGTTCC CCTCGTCTTC CGACATTAAG
GATGCGCAAG GTAACACTCT CGTCACCACA TACGCCGTGC ATCGTCCCGA CGGCAACTGG
TCGCTCATGC TCGTCAATCG CGACGAGAAC ACCCCGCACG ATGTGAAGAT CAACTTCGAC
GGCGATGCCG GCTCGCATTT TGAAGGACCG ACCACGTTCG TCACCTTCGG CAGCGAACAA
TATGTCTGGA TCAACGACGG CCCGAATAGC CACGCCAATC CTGACGGCCC GGCGGTTACG
CGCATCCTTG CTGCCGACGG ACAATCCGTC TTCACACTGC CGAAAGCTTC GGTCACGGTG
ATTCGCGGCC ACATCGCCGC GAAGTAA
 
Protein sequence
MKRFNTAVVL LSLSLATTTA FAQPAVGTIR VDTNPQHILN SFDPDQALGS SVDVLSHYDI 
EKVYSPHILQ ESLSAGWGPI TYRNNSELRM AYLHWNAEGT WSDPTHRSGY FIGSTELKKP
IRYSLSYGLP HRGFATSGDR PIESPNLTYW KSNPYLTSKF TGESDALHPQ WVVVDLQSEK
SVNAVRIQWA EPYAATYTVD YWVGKEPLEW DGGPKGEWKT FASGELKHAK GGTATLKLAA
APVSTRYVRV WMTESSNTCD EHGAEDSRNC AGYAIQSIQA GSLDKPGHFV EAPRAASDKE
ITYTASSIDP WHSSEDVKDD GFYQHTGFDL FFTSGITNGL PATIPVTMLY GTPEDAAAQI
AYIQKRGYSI GYVELGEEPD GKHAMPEDYA ALYIQWAKAL HQVDSKLKLG GPIFEGVNKD
ITLWPDAKGR ISWMGRFVAY LKDHGHLDDL AFVSFEHYPF EPCEITWKSL YEEPQLMKGI
LKTWRDDGVP ANVPLMVTEN HLAAQLTGPM TTIFAALWLS DNVGSFFEGG GAVFHHSPIQ
PQDIHETCLG WASWSNFVAD ERYNIKGYTS PWFAARMINI EWVQHRSGVH QMFPSSSDIK
DAQGNTLVTT YAVHRPDGNW SLMLVNRDEN TPHDVKINFD GDAGSHFEGP TTFVTFGSEQ
YVWINDGPNS HANPDGPAVT RILAADGQSV FTLPKASVTV IRGHIAAK