Gene Acid345_0650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0650 
Symbol 
ID4069742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp800618 
End bp803104 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content57% 
IMG OID637982656 
ProductAlpha-glucosidase 
Protein accessionYP_589729 
Protein GI94967681 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.178818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.195512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATACG AATACCGGCC GCGTACCTGC CGAACGTTTA TCTTTCGCGT GGCCTGCTTC 
GTGTTGCTGG CGCTGAGTGC AACGTTTGCG GATTGGCAAT CCATTGGAGA TCTCAAGCCC
AGGGGCAGAC AGGCAAATGA ATTCACCTTT GTCAACGCGC GGACAACGGT GGCAATTACC
GTGCTCGCTC CGGACATCGT GCGCGTGCGG GCCGTGGCTG GAACCTCTCT TCCCCCCGAT
CATTCGTACG CTGTAGTAAA GACCGACTGG CCCGCGGTCA AGGTGGACTT CTCTTCGAAA
GAGAACGTTG AAAGCATCCG CACCGAACAA TTGGAAGTAC GAGTTCAGCT GGCGCCGTTC
CGGCTTGCGT TCTACGACGC GAAAGGCAGA CTCATCTCGA AGGACGCCGA CGACCAGGGA
ATGAGTTGGG ATGGACATCG TGTCCGAGTA TGGAAAGACC AGCCAACGGA CGAGCACTAC
TTCGGGTTCG GAGAGAAGAG TACGCCTCTC GACAAGCGAG GCCGTTCGCT CGTGATGTGG
AACAAGGACC CTGAGGGATT CGATGGCACA ACGGAACCCC TGTATCAATC CGTGCCGTTC
TTCGTCGCGC TACGACAGGG TCGCGCGTAC GGCACCTTTC TGGATAACAC GTGGCGAAGC
TCGTTCGACA TGGGATCGGA AATCCCCGAC GTTTACTCGT TCGGCGCAGA GAACGGAGAA
CTGAATTATT ACTTCTTCGC GGGTCCGACG CCGAAGCAAA TCGTAAGCCG ATTCACCGAG
CTAGTCGGCC GCGTTCCGAT GCCGCCACGC TGGTCTCTGG GCTACATCCA AAGTCGTTAC
AGCTACTACC CGGAAACCAA AGTTCGTTTT ATCGCCGAGA ATTTCCGCGA ACGCGACATT
CCCTGCGACG GGATTTTCCT CGACATCGAC TTTATGGACG GTTTTCGCGT CTTCACCTGG
GACAAGTCGC GCTTCCCCGA TCCCAAACGC ATGATGACCG ACCTGCGGCA GCAGGGATTT
CACATCATCG CGATCGTGGA TCCGATGGTG AAAGTCGATC CCAACTATTG GGTTTACAAA
CAGGGGCTGG AAAACAATTA CTTCGTGAAG AAGCCCGATG GCACGGTCTT CACCGGCAAG
GGCTGGGGTG GTCAGAGCGC GTATCCGGAC TTCGCTTCAT CCAAGGTCCG TGACTGGTGG
GCGGGTCTTT ATAAGGAACA AATTGACCAG GGCGTGGCCG GAATTCTCAC CGACATGAAT
GAGCCCGCCG TGATCGGCAC GAATGGCCCG ACAACCACAT TCGACATGGA CATGGTTCAC
CACACGGAGA TGGGGCCTCG CACCCACGCC GAAATCCACA ACGTCTATGG GATGTTGGAG
ACGCTCGCTA CACGGGATGG CATGTTGCGG GCGCGGCCGA ACGAACGCCC GTTCATCATT
ACTCGTGCAA CGTTCGCCGG CGGCCAGCGC TATGCAGCCC AATGGAGCGG AGACAATTTC
GGCACCTGGG ACCATCTTCG TCTTAGCATG CCAATGCTCA ACGGCATGGG CCTTTCCGGA
TTGCAGTTCG TGGGCGCCGA CATCGGTGGA ATCATGCCGG TTCCGAGCCC CGAACTCTAC
ACGCGGTGGA TGCAGACCGG AGTACTGACT CCGTTTGTCT GGACACATTC GCTTGGTCCG
GGAAATCTTG AACCGTGGGG CTTCGGCAAT CGCATGGAGG CGATCAATCG CGAGTCGATC
AAGCTGCGCT ACCGGTTGAT GCCGTACATC TATACGACGT TCTGGGAAGC CGCGACCACC
GGTCAGCCAA TCATGCGCCC GCTGCTGCTC GAATATCCGG ATGATCCGTG GGCGATAGGA
ACCAATGACG AATATCTCTT CGGCAACGAT CTGCTCGTTG CCCCGATCGT GAAAGACTAC
GACGAATCCC GTGGGGTTTA CCTGCCGAAA GGGACTTGGT ACGACTACTG GACCGACCAC
AAGTATGTGG GTCCGCAGAT GATCACTGTA AATGCGCCGC TCGATCGCTT GCCACTCTTC
GTTCGCGGTG GAGCAATCCT TCCAAGCCAG CAGGACATGC AGCATACCGA TCAGTTCCCG
ATCGATCCGC TCACGCTCGA CATCTATCCA GATTCGTCTT CTTCGCGCCA GTATTACGAC
GACGACGGCA TCAGCTTTGG GTATCAGAAA GGCGCCTACT ACGTACAGAC GATCACCGCG
GAGGCAACGA CCGCAGGGGT GAACGTCACG TTGTCCGCTC CTGAGGGAAG CTTCCGTCCG
CCGAAGAGAT CTCTCGTGCT GCGAGTGCAT CTCCAGGCTG CACCGCCGAG CGGGGTGTCA
CTCGGAACCT CGAGGCTTTC GCAGCAGGAA TCCGTGAAGA AGTTGCAGGA AGTGCAGAAT
GGCTGGCTGT ATGATTCGGA CTCGCACACG GTTTGGATCA AGTTTCCTGA CCAGGAAGCC
GCCGCCAGCG TCGCGATTTC GCGCTGA
 
Protein sequence
MSYEYRPRTC RTFIFRVACF VLLALSATFA DWQSIGDLKP RGRQANEFTF VNARTTVAIT 
VLAPDIVRVR AVAGTSLPPD HSYAVVKTDW PAVKVDFSSK ENVESIRTEQ LEVRVQLAPF
RLAFYDAKGR LISKDADDQG MSWDGHRVRV WKDQPTDEHY FGFGEKSTPL DKRGRSLVMW
NKDPEGFDGT TEPLYQSVPF FVALRQGRAY GTFLDNTWRS SFDMGSEIPD VYSFGAENGE
LNYYFFAGPT PKQIVSRFTE LVGRVPMPPR WSLGYIQSRY SYYPETKVRF IAENFRERDI
PCDGIFLDID FMDGFRVFTW DKSRFPDPKR MMTDLRQQGF HIIAIVDPMV KVDPNYWVYK
QGLENNYFVK KPDGTVFTGK GWGGQSAYPD FASSKVRDWW AGLYKEQIDQ GVAGILTDMN
EPAVIGTNGP TTTFDMDMVH HTEMGPRTHA EIHNVYGMLE TLATRDGMLR ARPNERPFII
TRATFAGGQR YAAQWSGDNF GTWDHLRLSM PMLNGMGLSG LQFVGADIGG IMPVPSPELY
TRWMQTGVLT PFVWTHSLGP GNLEPWGFGN RMEAINRESI KLRYRLMPYI YTTFWEAATT
GQPIMRPLLL EYPDDPWAIG TNDEYLFGND LLVAPIVKDY DESRGVYLPK GTWYDYWTDH
KYVGPQMITV NAPLDRLPLF VRGGAILPSQ QDMQHTDQFP IDPLTLDIYP DSSSSRQYYD
DDGISFGYQK GAYYVQTITA EATTAGVNVT LSAPEGSFRP PKRSLVLRVH LQAAPPSGVS
LGTSRLSQQE SVKKLQEVQN GWLYDSDSHT VWIKFPDQEA AASVAISR