Gene Acid345_2519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2519 
Symbol 
ID4069888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2974203 
End bp2975951 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content56% 
IMG OID637984536 
Productalpha amylase 
Protein accessionYP_591594 
Protein GI94969546 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.252753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTACT CCGCGCGCAT CGCGCTCGCC TTCTTTTCCC TATTCCTCTT CCTCCCGAAA 
ATCAGTTTCG CCGTCGATCA AGCCCTCAAC GGCTACGAGC CCAAGTGGTG GAAAGAAGCA
GTTGTATACC AGGTCTATCC GCGCTCGTTC AAAGACTCCA ACGGCGATGG CATTGGCGAC
CTGAAGGGCA TCACCTCGAA GCTCGATTAC CTGCAATCGC TCGGCGTGGA CGTCATCTGG
CTGAGCCCGC ACTACGATTC CCCCAACGCC GACAACGGCT ACGACATCCG CGATTACGAG
AAAGTGATGA AGGAGTTCGG CACCATGGCC GACTTCGACG AACTTCTCAA AGGCGTGAAG
GCTCGCGGCA TGCGCCTGGT GCTCGATCTC GTGGTGAACC ACACTAGCGA CGAGCATCGC
TGGTTCGTCG AGAGCCGCAA GTCGAAGGAC AATCCGTATC GCGATTACTA CATCTGGCGC
CCCGGCAAAG ACGGTGGCCC GCCGAATAAT TACACCTCAT TCTTCTCCGG CTCCGCGTGG
ACGCTCGATC CCACGACCAA CGAGTACTAC CTGCACTGCT TCGCGGTGAA GCAGCCTGAC
TTGAACTGGG ACAACCCAAA AGTCCGCCAG GAAGTGTATT CCCTGATGAA GTTCTGGCTC
GACAAGGGCG TGGACGGATT CCGCATGGAC GTCATCCCCT TCATCTCGAA ACTGCCCGAT
CTGCCGGACA TCCCGCCCGA GTATCGCGAA CGTCCGCAGT ACTTCTACAC CCAGGGGCCG
CATCTGCATG AATATCTGCA GGAAATGAAT AAAGAGGTTC TCTCGAAGTA CGACATGATG
ACGGTCGGTG AGGCGTTCGG CGTCACGCTC GAGGGCACTC CGATGCTGGT GGATGAGCGC
CGTCACGAAC TCAACATGAT CTTCAACTTC GATGCGGTGC GAATTGGACA TCCCTCGACA
CCATGGATCG GCTGGACACT GCCAAAGCTG AAAGCGATCT ATACCGACGA AGACCAGAAG
CTGGATCAAC ACAGTTGGAA TACGGTCTTC CTGTCAAACC ACGACAATCC CCGCGTAGTC
TCTGCCTTTG GCGACGACTC TCCGGAGTGG CGCGAGAAAT CAGCGAAGCT GCTCGCGACG
ATGGTCCTCA CCCTCAAGGG CACTCCGTTC ATCTATCAGG GCGACGAACT CGGCATGACC
AATTATCCGT TCAAGGGCAT CGAGGACTTC GACGACATCG AAGTAAAGAA CGCGTGGAAG
GAATACGTGG AGACTGGACG CATCAGCAAA GAACACTTCC TCGACAACGC CCGGCGGGTG
GCACGCGACA ACTCGCGCAC TCCGATCCAG TGGGATGATT CGAGCAATGG TGGCTTCACC
ACCGGCAAGC CCTGGCTCGC GGTAAACCCG AATTACAAGA AAATCAATGC TGCAGAGGAG
CAGAAAGACA AAGACTCCGT CTACCAATAC TTCCAGCGCA TGCTGGCCTT CCGCAAGACG
ACCAAGGCTT TCAGCTACGG CGATTACAAG GACCTCGATC CGCAAAACGA AAAGATCTTC
GCCTACACGC GAACGCTCGG AAAAGAGAAG TATCTCGTCG TGCTTAATTT TTCGAAGGAT
GCGCTGAAGT ATTCCCTGCC CGGAGTGAAG GCGGGAAAAC TGGTGATGTC GAACGAAGGT
GCGGCGGAGG AGAACGCGAC CACGCTAATG ATGAAAGGCT GGGAAGCCCG GGTTTACAGA
GTCGAGTAA
 
Protein sequence
MTYSARIALA FFSLFLFLPK ISFAVDQALN GYEPKWWKEA VVYQVYPRSF KDSNGDGIGD 
LKGITSKLDY LQSLGVDVIW LSPHYDSPNA DNGYDIRDYE KVMKEFGTMA DFDELLKGVK
ARGMRLVLDL VVNHTSDEHR WFVESRKSKD NPYRDYYIWR PGKDGGPPNN YTSFFSGSAW
TLDPTTNEYY LHCFAVKQPD LNWDNPKVRQ EVYSLMKFWL DKGVDGFRMD VIPFISKLPD
LPDIPPEYRE RPQYFYTQGP HLHEYLQEMN KEVLSKYDMM TVGEAFGVTL EGTPMLVDER
RHELNMIFNF DAVRIGHPST PWIGWTLPKL KAIYTDEDQK LDQHSWNTVF LSNHDNPRVV
SAFGDDSPEW REKSAKLLAT MVLTLKGTPF IYQGDELGMT NYPFKGIEDF DDIEVKNAWK
EYVETGRISK EHFLDNARRV ARDNSRTPIQ WDDSSNGGFT TGKPWLAVNP NYKKINAAEE
QKDKDSVYQY FQRMLAFRKT TKAFSYGDYK DLDPQNEKIF AYTRTLGKEK YLVVLNFSKD
ALKYSLPGVK AGKLVMSNEG AAEENATTLM MKGWEARVYR VE