Gene Acid345_4079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4079 
Symbol 
ID4072501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4829721 
End bp4831262 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content59% 
IMG OID637986110 
Productalpha amylase 
Protein accessionYP_593153 
Protein GI94971105 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.348017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCAA CTGCTGCTCC CTCCAAGAAT CAGGGGAGGA GCGGGAGCGC TGAGCGCTCT 
CAGCCTTTGC CACTTCACAA ATATCCATCG CTTTACCAGG TCAACACTCG CGCCACGATG
CATGACCTCT CCGTAAAGCT CGGCCGCAGC GCTACGCTTG ACGACGTCTC TGACGAATAC
CTTTCCAGCC TCGCGCAACA AGGCTTCGAC TGGGTGTGGT TCCTCGGCGT GTGGCAAACC
GGCGCCGCTG CCCGCGCGGT CTCACGCTCG CATCAAGAGT GGATCGACGA GTATCGCCAA
ACCCTCGGCG ATTTCACCGA CGGCGACATT TGCGGATCGT GCTTCGCGAT CACCGCTTAC
CAGGCGCACA CCGACTTCGG AGGCAACGAC GCCCTTGCGC GCCTCTACAA GCGGGCTCAC
CAGCAAGGTC TGCGGCTATT GTTGGATTTC GTGCCCAACC ACACCGCGCT GGATCACTCC
TGGCTCGAAA CACATCCTGA CTACTATGTC GCCGGAACCG ATGAAAAGCT GCAGCACGAA
CCGCAGAACT ATGTGCGGCT CAACACGTCG CAAGGTCCGC GCATATTCGC CTTTGGCCGC
GATCCGTATT TCGCCGGATG GCCCGACACG CTGCAACTGA ATTACGCGAA CCCCGCTCTC
CAGGCCGCCA TGCGCCTGGA GCTTCTCAAC ATCTCGGAGT TCTGCGATGG CGTTCGCTGT
GACATGGCGA TGCTCATCCT TCCCGATGTC TTCGAGCGCA CGTGGGGGAT GCATCCAGAG
CCATTCTGGC GCAAGACGAT CGACGCCGTT CGCGCTTCAA AAGCCGGTTT CCTGTTCATG
GCGGAGGTCT ACTGGGACCT CGAATGGACG CTCCAGCAGG AAGGCTTCGA CTATACCTAC
GACAAACGCC TCTACGATCG TCTCCGCGAA GGCCACGCCA CTCCGGTACG CGACCATCTC
CGCGCCGACA TGGAGTTTCA ACGCAAGTCC GCGCGCTTCC TTGAGAACCA CGACGAACCG
CGCGTCGCCG CGACCTTTAC CCCGGAAATG CATCGCGCCG CCGCCATCAT CACCTATCTC
TGTCCCGGCC TGCGGTTTTT TCACGACGGA CAATTCGAAG GCCGCGTCAA GCGTCTCTCC
GTTCACCTCG GTCGCCGCCC GGAAGAGTCT ACGAACGAAT CGATCCGCGC TTTTTACCAA
CAGCTTCTAA AGTGCATCCA CGAGCCCGCC ATTCGCGATG GTCAGTGGCA GCTTTTGAAT
GCCACACCCG CTTGGGTCAG CAATTGGACC TTCGAATCGT TCGTCTGCTT CAGTTGGCAA
GCTGACGGCG AGCTGCCCTT CCTAGTGGTT GTGAACTATA GCGATCACCA AAGCCAGTGT
TACTTGCAAC TCCCGTTCGA CACGCTCCGC AGTCACTCGG TTTCGTTGCA GGATTTGATG
GGTTCCGCAA TCTACGAGCG CGTCGGCGAT GAACTGCTCT CTCGCGGACT CTACCTCGAC
ACGCCGGCGT GGGGCTTCCA CGTCTTCAAG ACCACAATCT AG
 
Protein sequence
MAATAAPSKN QGRSGSAERS QPLPLHKYPS LYQVNTRATM HDLSVKLGRS ATLDDVSDEY 
LSSLAQQGFD WVWFLGVWQT GAAARAVSRS HQEWIDEYRQ TLGDFTDGDI CGSCFAITAY
QAHTDFGGND ALARLYKRAH QQGLRLLLDF VPNHTALDHS WLETHPDYYV AGTDEKLQHE
PQNYVRLNTS QGPRIFAFGR DPYFAGWPDT LQLNYANPAL QAAMRLELLN ISEFCDGVRC
DMAMLILPDV FERTWGMHPE PFWRKTIDAV RASKAGFLFM AEVYWDLEWT LQQEGFDYTY
DKRLYDRLRE GHATPVRDHL RADMEFQRKS ARFLENHDEP RVAATFTPEM HRAAAIITYL
CPGLRFFHDG QFEGRVKRLS VHLGRRPEES TNESIRAFYQ QLLKCIHEPA IRDGQWQLLN
ATPAWVSNWT FESFVCFSWQ ADGELPFLVV VNYSDHQSQC YLQLPFDTLR SHSVSLQDLM
GSAIYERVGD ELLSRGLYLD TPAWGFHVFK TTI