Gene Acid345_1502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1502 
Symbol 
ID4069249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1830984 
End bp1832816 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content60% 
IMG OID637983511 
Productalpha amylase 
Protein accessionYP_590578 
Protein GI94968530 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.855853 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTCGTC TCTCGCGTCT TGTCGCGTAT TTGCTTTTTT CCTGCGCTCT GTTTGCGCAG 
GCCCCCAAAA TTTCTAAAGT CGATCCACCC AACTGGTGGG CGAATTATCC GCACAGCCCG
ATGTTGCTGC TTACGGGCGA GAACCTTGCC AACGCGAAGG TCTCTGCCAA TTATCCGCAC
TTGAAAATCA CCAAGTCGGA AAGCAGCGCC GACGGGCGCT ACGTCTTCGT TTATCTCGAC
GAGCAGAAAG ATCTCAAGCC CGGCACCGCG CACTTCTCAG TGCAGACGGC GGGCGGAAAC
ACGGCTCTTG ATTTTGTTTT CGATAAGCGA CCGAGCCTCG AAGGCCGCGC CCAGGGGCTG
AACGCCAGTG ACACGATTTA CCTCATCATG CCCGACCGTT TTGCCGACGG CGATCCGTCG
AACAACGATC CGCAGAATGC GAAAGGGCAC TACGACCGCG CGAAGCAGAT GGCGTATCAC
GGTGGCGATC TCAAGGGCGT CACCGATCAC CTCGATTACC TGCACGATCT AGGCGTGTCG
ACTGTCTGGC TCACCCCCTG GTGGAAGAAC GACGGCAACT CCGCGGATTA TCACGGCTAT
CACGTCACGG ATTTCTATGG CATCGAAGAT CACTTCGGCA ACATGAAGGA CCTGCAGCAG
ATGGTTTCGG CCGCGCACGG CAAGGGCATG AAGGTCCTGA TGGATTACGT GGTGAACCAC
ACCGGCCCGT TCCATCCGTG GGCGGAGCAT CCGCCAACCC CAACGTGGCT GCACGGCACT
CCGGCGAAGC ATCCGCAGCC CAAGTACAAC TTCTGGCCGC TTGTGGATCC GCATGGCACG
CAAGCCGACC GAACGCCGGT CCTCGAAGGC TGGTTCGTGG ACCGTCTTCC TGACCTCAAC
GTGGACGACC CGAAGTTGAC GGAATACCTC ATCGACAACG GGCTCTGGTG GATGGAAACC
GCCAGCCTCG ACGGCTATCG CCTCGATACG TTCCCTTATT CCTCGCGCGA GTTCTGGAGC
AAGTGGCACA AGGCGCTGTT CGAGGTCTAT CCCAGGACGT TCACCATCGG CGAAGTGTCG
GATGGCGATC CCGCGGTGGT TTCTTTCTTC CAGGGCGGCC GCAAAGAATA CGACGGGATT
GATTCCGGCG TGACCACGGT CTTCGATTTC CCGACCATGT ACGCGATCCG CGACGTGCTG
ATCCGGCAGC AACCTGCTTC GAAGCTGCAA GAGGTCCTGG AGCACGACGC GCTGTATCCC
AACCCGGCGG TGCTGGTGCC GTTCATCGGC AACCACGACA AACCGCGCTT CATGGGCGAG
AAAGGCGCGA CCGTGCCGGA GTTGAATGCC GCCGCCAGCC TGCTGCTCAC GTTGCGCGGC
ATCCCGCAAC TCTACGCAGG CGATGAAATC GCCATGCCGG GCGGCGAGGA TCCCGATAAT
CGCCGCGACT TCCCAGGCGG TTTCGCGGGT GATCCACAAA ACGCTTTCAC TGCGTCGGGA
CGTACCCCAG AACAGCAGGA AGCCTTCGCG CATTTGCAAA AGCTGCTTCA GCTTCGCAAG
CAGCACAAAG CGCTGCAGAG CGGCGAACAA ACGGACCTCT TCTCGTCCGA GAAAGGGTTC
GCCTATTACC GCGTCAGTGG CGACGACCGT GTGCTTATCG TGCTGAACTC GGGCAGCGAC
GCGCAAACGA TCGCCATCCC GAAGGTGCAG ACGCCCCTCG CGAATGCAAC TTCCTTTACC
GCGCTCGACA GCGCTGCCAC GGCACAAACG AGTGGTGACA GCGTCACCGC GAACGTACCC
GGCATGACGG TCGCGATTTT CCAGGTCAAG TAG
 
Protein sequence
MCRLSRLVAY LLFSCALFAQ APKISKVDPP NWWANYPHSP MLLLTGENLA NAKVSANYPH 
LKITKSESSA DGRYVFVYLD EQKDLKPGTA HFSVQTAGGN TALDFVFDKR PSLEGRAQGL
NASDTIYLIM PDRFADGDPS NNDPQNAKGH YDRAKQMAYH GGDLKGVTDH LDYLHDLGVS
TVWLTPWWKN DGNSADYHGY HVTDFYGIED HFGNMKDLQQ MVSAAHGKGM KVLMDYVVNH
TGPFHPWAEH PPTPTWLHGT PAKHPQPKYN FWPLVDPHGT QADRTPVLEG WFVDRLPDLN
VDDPKLTEYL IDNGLWWMET ASLDGYRLDT FPYSSREFWS KWHKALFEVY PRTFTIGEVS
DGDPAVVSFF QGGRKEYDGI DSGVTTVFDF PTMYAIRDVL IRQQPASKLQ EVLEHDALYP
NPAVLVPFIG NHDKPRFMGE KGATVPELNA AASLLLTLRG IPQLYAGDEI AMPGGEDPDN
RRDFPGGFAG DPQNAFTASG RTPEQQEAFA HLQKLLQLRK QHKALQSGEQ TDLFSSEKGF
AYYRVSGDDR VLIVLNSGSD AQTIAIPKVQ TPLANATSFT ALDSAATAQT SGDSVTANVP
GMTVAIFQVK