Gene Acid345_0944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0944 
Symbol 
ID4070826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1202075 
End bp1204549 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content60% 
IMG OID637982951 
Productglycoside hydrolase/PKD 
Protein accessionYP_590021 
Protein GI94967973 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.321699 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCTT CCTATTTGCG ACAGTGTCAG ATCGCAGTGC TGGTGGTGGC GGGAATGGCG 
TGTGCCGTTT CGGCATCCGC ACAGTCGCTT GCTAACAAAG CCGTATCCAT CTCGGTGAAT
GCCGGCGACG GCTCGTATCA ATTGGCGGGC GTCGATGGGA AACCGGTGCT GTCGGCGCGG
GTAGGAGCGG AGGTCGATCA CAAGTGGCTG CGGTCGTCGG AGTATGGCGG TTGCAAGGCT
GCTGAGTCGA AGTTCAACGA TGACCTTGGC GCAGGCAAGC AGATCGCGGT TACATGCGCG
GGAACAGCGA GCAAGCCGGA ACTCACATAC GTGTTGCAAG CCTACGATCA GGCCCCGTAT
GGAACGGTGC AGGTGAAGCT TCGCAACACC ACGGGGAAGA AGCTTTCGGT GCAGGCAATT
CGCAGCGTCG AAGCAATCGG AGAATCGCGC ATCGAATTGG GCGCGAACGC TGCGGCGGAT
CGCGTGCTCT CCGACAGCTT CAGCGAAGAC TGGCCGGACC TGAAGATCTA CGATCTCGCG
CAAGCGCCGG AGGGCCTGCA TCGTGGCGTG GGCAGCCAGC TTATTTATAA CCGCGAGAGC
AAGCAGGGCC TGTTCCTCGG GGCACTGACC TCCGAAAAAT TTCTTACCAT CCTGCGCTTG
AAGACCGACG GCGCAAAGAT CGGGTCGTAC GAAGTGGATG CCACAGGAAC GACAGAAATC
CAGCGGGCGC TCCACTTGGA GGACTCACCG GCGGAGGACG TCGTCGAGCT GAGCCTGCCG
CTGGCTGCGG GAGAGACGAT GACCTCCGAT CGCCTGATGC TTGCGATTGG CGGTGACTAT
CACGCGCAGC TGCTGGCGTA CGGCGATGCC ATACGTCGCC TGCATCATGC GCGGGTAAAC
GGCGAGACTC CCGTGGGCTG GTGGAGTTGG ACTGCCTACT ATGGCGCCAT CAACCAGGGA
GAAGTGCTTG CCAATGCCGA CTGGCTCTCG CAGAACCTCG CGTCGCTCGG GTACACCTTC
TTCCAGGTGG ATGAAGGCTA CCAATACGCG CGCGGCGAAT TCACGACCAC GAACGCCACG
CAATTTCCCG ATGGCATGCG CGTGGTGGGA CATCACATCG TTGGCGATGG ACTGGTCTTC
GGCTTGTGGA CAGCGCCCTT CGAGATCACT ACACGCTCGT GGGTCTTCCA GAACCACAAA
GATTGGCTGG TGAAGAATGC GAAGGGCCAG CCAATCCCGA TCGGCGACGT GTGGGGCCAG
CACGTGGACA CGCTTTACGC GATTGATACG ACCAACCCCG GAGCGCAGGA ATATCTGCGC
CAGACGTATA AGACGATTGT GCGCGAGTGG GGCGTTCGTT TCATCAAGCT CGACTTCATG
GACACCACCG CCATCGAAGG CTACTACTAC AAACCGAACA CCACCGCCCT TGAGGCGCAG
CGCATCGGCC TGCAGATCAT TCGCGACACT GTGGGTGACG AGGTCATCCT CGACAAAGAC
GGTAGCCCGA TGTTGAACCC TGTCGGTCTG GTAGATTCCG GCCGCGTCTC CGCCGATACT
GGTCACAACT TCGAGCGCAC GAAAGCCGCG GAGCCTGGCA TTGCCGCGCG CTTCTACATG
CATCGGAACT TTTTCATCAA CGATCCCGAT GCGTACAACG TGACCGACAG CTATCTCATG
GAGGAGCACG AGCAGAAGCC GCCCGTCACG CTCGCGGGAG CGCAGGCGTC GATTGCACTT
TCGGCGATTT CCGGCGGGAA CTACGAGATT GGCGATGACA TGCTGTTGCT CGGTCGCGAG
AAAGATCGCC TGGCGTTGGC GTCGAATATC GATCTGATCA ACATGATCCG GATCGGCCGC
GCTGCGACTC CTGTGGACTT GCTGACTTAC GCATCCGAAG ACGAACAGCC CAGCGTGTTC
TTCCTCCGCG AAGATCAACG CCAGGCCGTG CTCGTAGTTT TCAACTGGAC GAAGTCATCG
CGGACACACC AGTTCCAACT TGCGGATCTA GGGCTTCCGG CATCAGGAAA ATTTGAAGCG
ACGGATGTGC TGAACGGAAA TGCTGCCGTC GCGCTCGGAA ACGGTGCCTT GCGAATTGCC
GACCAGTCGG GCGAATCGGT GAGAGTGATC AAGATCGTGG ATTCGAGCGT TCAGCCCTCG
GCGCCAGTGC TGAAGACCAG CGTGCCGGAG ACGGCAAAGG CCGGCGAGAC GATCCATTGC
AGCGTGCAAG CCGATCTCGA ACATACTCCG GCGACCACCT ATCGCTGGGA TTTCGGCGAT
GGCACCCACG CTTCGGGGAA AACCGCTACG CACGCGTATC CAACGGCGGG AGAGTACAGC
ATCCAACTTG TGGCGGAGGG ACTCGACGGC GTACCGGCGA ACCAGACGTT CACGGTCAAG
GTCACTGGCA ATCTGCCCGT GATGCCGCAA CTGAAAGACA ACCGTCGCTT CGTGGAGCCC
ACCGAACGAA AGTAG
 
Protein sequence
MNSSYLRQCQ IAVLVVAGMA CAVSASAQSL ANKAVSISVN AGDGSYQLAG VDGKPVLSAR 
VGAEVDHKWL RSSEYGGCKA AESKFNDDLG AGKQIAVTCA GTASKPELTY VLQAYDQAPY
GTVQVKLRNT TGKKLSVQAI RSVEAIGESR IELGANAAAD RVLSDSFSED WPDLKIYDLA
QAPEGLHRGV GSQLIYNRES KQGLFLGALT SEKFLTILRL KTDGAKIGSY EVDATGTTEI
QRALHLEDSP AEDVVELSLP LAAGETMTSD RLMLAIGGDY HAQLLAYGDA IRRLHHARVN
GETPVGWWSW TAYYGAINQG EVLANADWLS QNLASLGYTF FQVDEGYQYA RGEFTTTNAT
QFPDGMRVVG HHIVGDGLVF GLWTAPFEIT TRSWVFQNHK DWLVKNAKGQ PIPIGDVWGQ
HVDTLYAIDT TNPGAQEYLR QTYKTIVREW GVRFIKLDFM DTTAIEGYYY KPNTTALEAQ
RIGLQIIRDT VGDEVILDKD GSPMLNPVGL VDSGRVSADT GHNFERTKAA EPGIAARFYM
HRNFFINDPD AYNVTDSYLM EEHEQKPPVT LAGAQASIAL SAISGGNYEI GDDMLLLGRE
KDRLALASNI DLINMIRIGR AATPVDLLTY ASEDEQPSVF FLREDQRQAV LVVFNWTKSS
RTHQFQLADL GLPASGKFEA TDVLNGNAAV ALGNGALRIA DQSGESVRVI KIVDSSVQPS
APVLKTSVPE TAKAGETIHC SVQADLEHTP ATTYRWDFGD GTHASGKTAT HAYPTAGEYS
IQLVAEGLDG VPANQTFTVK VTGNLPVMPQ LKDNRRFVEP TERK