Gene Acid345_3271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3271 
Symbol 
ID4072683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3873671 
End bp3875755 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content59% 
IMG OID637985292 
ProductBNR repeat-containing glycosyl hydrolase 
Protein accessionYP_592346 
Protein GI94970298 
COG category[R] General function prediction only 
COG ID[COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00240388 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTAG CCACCGCGTC CATTCCAAGG GGAAGGGCGG TATCATTAAC CCAAATGTTC 
ATGCGCAAGC TGTTTTCGCT GCTGTTTGTC CTGTTGTTGG CTGTGACCCT CGCATCGGCT
TCATCGTGGA AAGAACTTGG TCCAGATGGC GGCGACGCCC GCAGTCTCGC GTTCGACCCT
CACAATCCCG ACCGCATTCT GCTGGGTACA AGTTCCGGCC AGCTCTACAT GTCCAATGAT
CGCGGACACT CCTGGAATCG CTTCGCGCAG ATCGGCCTCG GCGACGACTA TGTCCTCGAC
AACACCGCGT TCGATCCCAG CGATCCGAAC ACGATTTACA TCGGTGTGTG GAGTGTGGAG
CACCAGCGTG ACGCCGGCGA TCTCTATGTC ACTCGCGACG GTGGCAAGTC ATGGAAGACC
ATCGAGGGCA TGCGCGGTAA ATCGATTCGC GCTCTTTCCA TCGCGCCGAG CAATTCGAAG
ACTCTGGTCA TCGGCGCACT CGATGGCGTC TATCGCAGCG ACGACAGTGG TGAAACCTGG
CGGCGTATCT CCCCGGAAAA TCACGCGGAG ATCAAGAACA TTGAATCCAT CGCCATCGAT
CCCAAAAATC CCGACGTCGT TTACGCTGGC ACCTGGCACC TGCCGTGGAA GACCGATGAC
GGAGGCAAGT CATGGCACCA CATCAAAGAG GGCGTCATTG ACGACTCCGA CGTCTTCTCC
ATCATCGTCG ACTTCTCGAA CCCTTCTACG GTATTCGCCA GCGCGTGCTC CGGTATCTAC
AAGAGCGAAA GCGCAGGCAA TCTCTTCCAT AAAGTTACTG GCATTCCTGC CACGGCGCGC
CGTACGCGCG TGCTGATGCA GGACCCGAAG AACCCGCAGA TTGTTTACGC CGGCACCACG
GAAGGGCTTT ACAAGACCCT CGATGGCGGT AAGACGTTCA AGCGCATGAC CGGCCCGGAA
GTCATCGTTA ACGATGTCTC CGTCGATCCG CGCGACACCA GCCGCGTGCT GCTGGCAACC
GATCGCAGCG GTGTTCTCGC CAGCGAAAAC GGCGGCGCCA CGTTCACGCA GTCAAACCGC
GGCTACTCGC ATCGCCAGGT CTCATCGCTG CTGGTGGATT CGAAAGATCC CAACACGATC
TATGTCGGAC TGCTGAACGA TCGCGACTTC GGTGGCATGT ATGTCTCGCG CGACGCCGGA
TCTACCTGGT CACAGGCGAG CAAGGGCCTG AAAGACCGCG ATGTCTTCAC TCTTCGCCAG
GCAGGTGATG GCGACATCTT CGCCGGCACC AATCACGGCG TCATGAAATT CTCGAGCAAG
ACGCTGTTGT GGGAGCCTGC CAGTGTCGTG GTGAAGGAAA AGACCACGCC TGGTCCTAAG
ATTCCCGCGA AGGTCGTTAA AGGCAAAAAG ATTCCGGCAC ACGAAGGCAC GCCGAAGGTC
ACCATCGAGA AATCGGAACT GACCTCGCAG GTGTCGCAAC TCGTGTTCAC TCCGGCGCTC
TGGTACGCCG CCGCAAGCAG TGGCGTTTAC AGCAGCAAGG ATGATGGCAA AACCTGGCAA
CACGCCGATA TCGAAGGTGA CGTACGTTTC CTTGCGATCG GCGCGTTCGG CGACAAGGCC
TTCGCAGCAT CCGCACTTGA CGGCTACGTC ACAACCGATC ACGGCGGACA CTGGACGCAA
GTGAGCGTGC CGAAGTTCAT CACCGGTATC TACGATGCTG CGGTTGGGTT CGATCAGTCG
CTTTGGCTGG CGACGCAGCA GGGTGCATTG CGTAGCGGTG ACGACGGAAA GACATGGGAG
CACGTTACGG CGGGACTCCC GTGGAAGCAC GTCCTCACCG TCAGCCTCGA TACCGCAAAT
AACCGGATGC TCGCCACCTC TCGCGATGGT CGCGGCGTTT ACTCCAGCTC CGACAACGGA
CAGACTTGGA AGTATTCAGA TGACGCTGGT CTGCTGGTTC GTAGCGCCGT CGGTTATCGT
GGCGGCTACC TGGCAGCGAC GGCGTACAAC GGCGTGGCGA TTTCCTCGGC ACCGGGCCAC
AGTGCGACGG CGCCTTCAGC GAGCGGTTCC GGTAACTCGA ATTAA
 
Protein sequence
MRLATASIPR GRAVSLTQMF MRKLFSLLFV LLLAVTLASA SSWKELGPDG GDARSLAFDP 
HNPDRILLGT SSGQLYMSND RGHSWNRFAQ IGLGDDYVLD NTAFDPSDPN TIYIGVWSVE
HQRDAGDLYV TRDGGKSWKT IEGMRGKSIR ALSIAPSNSK TLVIGALDGV YRSDDSGETW
RRISPENHAE IKNIESIAID PKNPDVVYAG TWHLPWKTDD GGKSWHHIKE GVIDDSDVFS
IIVDFSNPST VFASACSGIY KSESAGNLFH KVTGIPATAR RTRVLMQDPK NPQIVYAGTT
EGLYKTLDGG KTFKRMTGPE VIVNDVSVDP RDTSRVLLAT DRSGVLASEN GGATFTQSNR
GYSHRQVSSL LVDSKDPNTI YVGLLNDRDF GGMYVSRDAG STWSQASKGL KDRDVFTLRQ
AGDGDIFAGT NHGVMKFSSK TLLWEPASVV VKEKTTPGPK IPAKVVKGKK IPAHEGTPKV
TIEKSELTSQ VSQLVFTPAL WYAAASSGVY SSKDDGKTWQ HADIEGDVRF LAIGAFGDKA
FAASALDGYV TTDHGGHWTQ VSVPKFITGI YDAAVGFDQS LWLATQQGAL RSGDDGKTWE
HVTAGLPWKH VLTVSLDTAN NRMLATSRDG RGVYSSSDNG QTWKYSDDAG LLVRSAVGYR
GGYLAATAYN GVAISSAPGH SATAPSASGS GNSN