Gene Acid345_2146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2146 
Symbol 
ID4068782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2564039 
End bp2566273 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content61% 
IMG OID637984161 
Producttransglutaminase-like 
Protein accessionYP_591221 
Protein GI94969173 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.360106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCCA CAGCGACCCA ATCCGGACTG AACCGCGCCC TGCACGAAAC CGCAGCGCGC 
GTTCCCGAAC CGATTGAGCA CTATTTCCAG GTGTCGTTGT TCCTGCTCTT GATCACCGGT
TTCGTAACCT TGGCAGGCAC CGGAAAACTG GACTTGCTGT CGGTTGTCTT CGTGCTCGCC
GCGTTAGGTC TCCGCGCAAT CCATCTCGCG CAGAACAAGC AAATCATTAT TCCGGAACGC
TGGACCACTG CGCTCACGAT TGCCTACGTC GCCGTCTACG CCGCCGATTA TTTCTTTCTC
TCACGCGACT TCGTCACCGC CACGGTTCAC CTCGTTCTAT TCGGAATGGT TGTTAAGGTC
TTCTCGGTTC ACCGCGAGCG CGACCTGCTG TACCTCGGTG TGCTGTCGTT CCTCATGATT
CTCGCCGCCT CGGTCCTCAC GGTGAACACT GCGTTCCTCG GTGGCTTCGC ACTCTTCCTG
CTCGTCGCGA TCGCAACCTT CGTCAGCTTC GAAATGCGCC GCTCCGCGCT GGCAGCCGAC
TCCGTGCAAT CGTTGAACGC GATCCCATCG CGTTCGCGAC CGCACGCGAC GAAGGTCAAC
ACTTCCCTGT CGCGCACCGC GCTGATGTTG GCGACCACAA TCCTCCTCGG CGCGACGGTC
CTGTTCTTCA CCCTTCCGCG CATCTCGGGC GGCTATCTCG GCTCTTACAC GCGTGGCTCC
GATCCGGTCA GCGGCTTCCG CGACAACATC CTGCTCGGAC AAATCGGCCG CATCCAGCAA
TCGTCGCAAG TCGTCATGCA CCTGCAGATC ACCGGGGACC ATCCGGCCTT CGACGGCAAA
GTTCGCGGCT CGGTCCTCTC CCGCTTCGAT GGCCGCTCCT GGGCCGACAC ACCGCGCTAC
ATGAACGTCA TCAATTCGCG CTTCGGCCGC TATGACATCT CCAACGAGAC GCTGGCTGCC
GATCCGTATC TCGAGCGCGT CTCGGCCACG CACAAGAACC AGAACATGCC TTATCGCGTC
TCGATGGAGC CGACGATGAG TTCGGTCCTG TTCCTGGTGA AGGGCACGAT TGAACTCCAG
GGCAGCTTCC GTCAGATCGC GTTCGATAGC GCACAGTCCT ACATCAACCT CGACGGCGGC
CATCCCTCGA ACGACTACTG GGGAGTCGCG AACGTCGCGC CTCCCGATCC CGCCCTGCTG
CGGCAAGCGG GCAGCGACTA TCCCGCCCGC GTCGCGCAGC GCTATTTGCA ACTGCCGCCA
CTTGATCCGC GGATTCCCGA TCTCGCGCGC AAAGTAACGG CGAAGGCATC GAACCCCTAC
GACAAAGCGC ATGCTATGGA GACCTATCTC CAGAGCAGCT ACGGCTACAC CCTCGAGTTT
CCTCTCGTAC CTCCAGCCGA TCCACTTGCG AATTTCCTCT TCGAGCGCAA GCAGGGCCAC
TGCGAATACT TCGCCAGCGC CATGGCGGTC ATGCTGCGTT CCGTGGGCAT TCCTACGCGC
GTCGCTACCG GCTTCCGTGG CGGCGAGTAC AACGACATCA CGGGCAGCTA CATCATTCGT
GCCCGCGACG CTCACGCCTG GGTCGAAGTC TATTTCCCAA ACCAAGGGTG GGTGACATTC
GATCCAACCG CTGCGGCGCC AATGGAGCCG GCCGGTCTAT TCGGGCGCTT ACGCCTCTAC
GCCGATGCCA TGAACGAGTT CTGGCGTGAG TGGATCATCA ACTACGACTT CCAGCACCAG
CGCACCCTCA CGGCCGCGGT GACCACCGAG TCCCTGCAAA AAGGCATGAG CCTGCGCGAT
TGGATCTCCG CGAAATACGA CCGCATGCTC GGCCGCGCGC GCCAGGTACA GAAATCATTC
TCAGAATCGC CGCAGCGCCA ATCGCGACTA GCCGTGACCG TAATTTGCCT CATGCTGCTC
ATCGTCATCG CGCCGCGCGC CTGGCATCTT CTCAAGATGC GCCGGATCGC CGCGCATCCC
GGCGACGCGC CCGAAGCCGC TGCTTCCATC TGGTATGGCC GCGCTACGCA TCACCTCGCA
CGCTACGGTT GGGCCAAGCA ACCCTCGCAA ACGCCCGCGG AATACGCGCA GAGCATCGAC
CACGAAACCA TGCGCCGGAC GATGGAAGAG TTCACACTGC TTTATGAACA GGCACGATTC
GGAGGTTCTG CGACCAGCGC GAGCCGTTTG CCGGAGCTGT TTCAACGCCT GAAGGTTCGT
CAGCGGGAAC GCTAA
 
Protein sequence
MASTATQSGL NRALHETAAR VPEPIEHYFQ VSLFLLLITG FVTLAGTGKL DLLSVVFVLA 
ALGLRAIHLA QNKQIIIPER WTTALTIAYV AVYAADYFFL SRDFVTATVH LVLFGMVVKV
FSVHRERDLL YLGVLSFLMI LAASVLTVNT AFLGGFALFL LVAIATFVSF EMRRSALAAD
SVQSLNAIPS RSRPHATKVN TSLSRTALML ATTILLGATV LFFTLPRISG GYLGSYTRGS
DPVSGFRDNI LLGQIGRIQQ SSQVVMHLQI TGDHPAFDGK VRGSVLSRFD GRSWADTPRY
MNVINSRFGR YDISNETLAA DPYLERVSAT HKNQNMPYRV SMEPTMSSVL FLVKGTIELQ
GSFRQIAFDS AQSYINLDGG HPSNDYWGVA NVAPPDPALL RQAGSDYPAR VAQRYLQLPP
LDPRIPDLAR KVTAKASNPY DKAHAMETYL QSSYGYTLEF PLVPPADPLA NFLFERKQGH
CEYFASAMAV MLRSVGIPTR VATGFRGGEY NDITGSYIIR ARDAHAWVEV YFPNQGWVTF
DPTAAAPMEP AGLFGRLRLY ADAMNEFWRE WIINYDFQHQ RTLTAAVTTE SLQKGMSLRD
WISAKYDRML GRARQVQKSF SESPQRQSRL AVTVICLMLL IVIAPRAWHL LKMRRIAAHP
GDAPEAAASI WYGRATHHLA RYGWAKQPSQ TPAEYAQSID HETMRRTMEE FTLLYEQARF
GGSATSASRL PELFQRLKVR QRER