Gene Acid345_3523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3523 
Symbol 
ID4072782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4168825 
End bp4169952 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content58% 
IMG OID637985546 
Productglycoside hydrolase family protein 
Protein accessionYP_592598 
Protein GI94970550 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.416811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTTG CGAAACGAGT TTTTGGGTTG TTACTGATCG TCTTTGTATC GCTTCCCGCG 
TTCGCGGCAG GGCGCAAGCG GCTGCTCGCC TACTATCCCG ATTGGGCGAA AGACCAAACG
CCCGCGTACG ACGCGTCGAA AATTCCCTAC ACCAAACTCA CGCATATCCT GCATGCGTTT
CTGCTGCTCG ATCCATCGGG AAACGGCGCG CTGCAAATCG ATCCCGAACT GATCGAGCCT
GCGCTGACCC GCAACGCACA TAAGGCCGGC GTGAAAGTAA TGATCTCGAT CGGCGGCGCC
GATGCCGAAC AGGCCACCGC GTTCGCAACC ATCGCCGCCA GCGCGCATTT GCGCAATACA
TTTGCGAAGA ACCTGCATGC ATTCCTTGTC GCGCACGACT ACGACGGTGT GGACATTGAC
TGGGAGGTTC CCAACGCGCC CGACGACACA ACGCACTGCA CGCAGCTGAT GGAAGCCCTC
CGTCATGAAT TGCCCTCGCC ACGCTGGTTG ATCTCGATGG CGATTCCCTC GGATCCGCGC
GGATATGGGA CAGGCTTCGA TGTCCCTGCA CTGGCGCCGC TTCTCGACTT CATTAACGTG
ATGACCTACG ACTTCACCGG GCCGTGGATG GACGAAGCCG GACTGAACTC TCCGCTCTAC
CAGGACCCGA ACGACCCCGA GCAGGCGGGA AGCCTGAAGA CTTCGATGGA CCTGTTTCAC
GGGATCTACG GAGTGCCGCG TGCGAAGCTG AACATCGGCA CGGCGTTCTA CGGCTATGAG
TTCGATAACG TGCGGCGGCT TTGGAACTAC TGTCCGAGTG GCGATTGCAG TGCTGCCCCG
TCGTGGAACT ACGGCACGTA CATCAAGCAG CGGGTGAATG CGAAAGGCTG GAAGCGCTAT
TGGGACGGCT CAGCGCAGTC GCCGTACCTG TTGTATCAAG GCACGGGCGG CAAGGATGGG
CTGATTAGCT ACGACGATCC GATCTCGACC GCGTTGAAGA CGTACTACGT TCTGAAGACG
CGGGACTTTG GCGGAATGTT TATGTGGGAC CTCAGTGGCG ATTACGACGG GAAATCACAA
GACCTGCTCG ATGCGATGTA CGCGGTTTCA GCGCTTGTAA GTCACTAG
 
Protein sequence
MTLAKRVFGL LLIVFVSLPA FAAGRKRLLA YYPDWAKDQT PAYDASKIPY TKLTHILHAF 
LLLDPSGNGA LQIDPELIEP ALTRNAHKAG VKVMISIGGA DAEQATAFAT IAASAHLRNT
FAKNLHAFLV AHDYDGVDID WEVPNAPDDT THCTQLMEAL RHELPSPRWL ISMAIPSDPR
GYGTGFDVPA LAPLLDFINV MTYDFTGPWM DEAGLNSPLY QDPNDPEQAG SLKTSMDLFH
GIYGVPRAKL NIGTAFYGYE FDNVRRLWNY CPSGDCSAAP SWNYGTYIKQ RVNAKGWKRY
WDGSAQSPYL LYQGTGGKDG LISYDDPIST ALKTYYVLKT RDFGGMFMWD LSGDYDGKSQ
DLLDAMYAVS ALVSH