Gene Acid345_3318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3318 
Symbol 
ID4070280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3932017 
End bp3932988 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content60% 
IMG OID637985340 
Producttransglutaminase-like 
Protein accessionYP_592393 
Protein GI94970345 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.695082 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTACT CCATCCGACA CCTGACCAAG TTCTCTTACG CGTCGCCGGT CAGCGAAAGC 
ATCATGGAGA CGCGGATGCG TCCGCGCAGC GACAGCAACC AGCGCTGCCT GCTGTTCCAT
TTGTCGGTGA GCCCGCGGTG CAGCGTGTTC TCGTTCCGGG ATCACATGGG GAACCACATC
CACCACTTCG ATATTCCGGG AGCGCACTCG CAGTTGGTGA TCGTGGCGGA GGCAGTGGTG
GAGCAGCAAG CGCCGGCGGC GCTGCCGGAT GCGCTGCCGT CGTCGGCTTG GGATGATTTG
GATTCGGAAG TGGAGCGCGG TGATTTCTGG GAGATGCTGC TGCCGAGCGA GTTTGCGAAA
CCGACGCCGC TGCTGCAGAA CCTCGCGGCG GAGTTGGAGG TTCGGCGCAA AGATGATCCG
CTGAGCGTTC TACGCGGCTT GAATGAGCAA CTCTATCGCT ATTTCGAATA CGTTCCGAAG
AGCACGCGGG TGGATTCGCC CATCGATGAC GCACTTGAAG CGCGATGCGG AGTTTGCCAG
GATTTCGCGC ACATCATGAT TTCGCTGGTA CGGCCGTTGG GGATTCCATG CCGCTACGTC
AGTGGCTATC TCAACAGCCG ATCCGAAGAT CACAACCGGT CGCCGGAGAC CGCAACGCAT
GCGTGGGTGG AGGCTTTATT GCCTGGTGTT GGATGGGTCG GGTTTGATCC GACGAACAAT
TTAATGGCCG GGGAACGGCA CATTCGGACG GCGATTGGGC GCGATTATTT CGACGTGCCT
CCGACCAAGG GGGTGTTCAG CGGCGACAGC CCAAGTGAAC TATCGGTGGC GGTACGGGTG
GCGGCTTCGA CGGCGCCTTC GGCACTGGAC GAGGATCAGC CTATTCCGGC AGATTGGGCG
ATTCTCGTCG AAAAGGCGCA GGAGCCACCA CGGCCAACCG CGGCGTCGCA AACCCAACAG
CAGCAGCAGT GA
 
Protein sequence
MYYSIRHLTK FSYASPVSES IMETRMRPRS DSNQRCLLFH LSVSPRCSVF SFRDHMGNHI 
HHFDIPGAHS QLVIVAEAVV EQQAPAALPD ALPSSAWDDL DSEVERGDFW EMLLPSEFAK
PTPLLQNLAA ELEVRRKDDP LSVLRGLNEQ LYRYFEYVPK STRVDSPIDD ALEARCGVCQ
DFAHIMISLV RPLGIPCRYV SGYLNSRSED HNRSPETATH AWVEALLPGV GWVGFDPTNN
LMAGERHIRT AIGRDYFDVP PTKGVFSGDS PSELSVAVRV AASTAPSALD EDQPIPADWA
ILVEKAQEPP RPTAASQTQQ QQQ