Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4031 |
Symbol | |
ID | 4071170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4765275 |
End bp | 4766438 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637986061 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_593105 |
Protein GI | 94971057 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.317277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAACA AGAAGCCCAT GAAAATCGGC ATCACGTGTT ATCCCACCTA CGGCGGCAGC GGCGTGGTGG CCACTGAACT CGGCATCGAA CTCGCGCAGC GCGGGCATCA GGTGCATTTC ATTTCCTATT CGCAGCCTAT CCGCCTGACT GAACCGCACC CCAACATCCA TTTTCACGAA GTCGAAGTCT CGCGCTATCC ACTCTTTGAG TACCCTCCGT ACGACCTCGC CCTCGCCACG CGCATGGCCG AGGTCGCCGA GATCTACAAC CTCGATCTGC TGCATGTTCA CTACGCCATT CCGCACTCAG TCAGCGCACT GCTCGCCCGC GAGATGACCG CATTCGGACC CGGCCGCAAA CGCCATCTGC CATTCGTCAC CACCCTTCAT GGCACGGACA TCACGCTCGT CGGCCTCGAT CCTTCGTATC TGCCGATCAC GCGTTTTTCC ATCGAGAAGA GCGATGGCGT CACCTCGATC TCGAACTACC TGCGCGAGAA GACGCTCCAG GCATTCGGCA TCAAAAACGA AATTCGCGTC ATTCCCAACT TCGTGAACTG CGATATCTAT CATCGCGACG GCAAAACGCA ACACTATCGC AAAGAGTGGG CCCCGAACGG CGAACGCGTG GTCGTGCACC TCTCGAACTT CCGTCCGGTA AAGCGCGTCC CTGATGTCAT TGAGATCTTC GAGCGTATCC AACAGAGAGT TCCTGCGAAG CTCGTCATGA TCGGCGACGG TCCAGATCGT TCGCGCGCCG AATGGATGGT CGTCGAAAAG AAGCTGCAGG ACCGCGTTCT CTTCCTCGGC AAACAAGACG ACGTCCACGA GAAACTGCCC GCGGCCGATC TCATGCTAAT GCCTAGCACG CTCGAGTCTT TCGGACTCGC CGCGCTCGAA GCCATGGCTT GCGAGGTGGT TCCTGTCGCG ACGAAAGCTG GAGGCGTTCC CGAAGTCATT GACCACGGCG TGGACGGCTA CCTCGCCGAT GTCGGCGACA TTGACACCAT GGCCATGTAC TCCATCGACA TCCTGAGCGA CGACGAAAAA CTCCACGAGA TGGCGAAAAT GGCGCGTTTC AAAGCACAAT CCACCTATTG CGCTTCGAAG ATTATTCCGA TGTACGAAGA TTTTTATCGT GAGGTGCTGG AGCGTGCTTC GTAG
|
Protein sequence | MTNKKPMKIG ITCYPTYGGS GVVATELGIE LAQRGHQVHF ISYSQPIRLT EPHPNIHFHE VEVSRYPLFE YPPYDLALAT RMAEVAEIYN LDLLHVHYAI PHSVSALLAR EMTAFGPGRK RHLPFVTTLH GTDITLVGLD PSYLPITRFS IEKSDGVTSI SNYLREKTLQ AFGIKNEIRV IPNFVNCDIY HRDGKTQHYR KEWAPNGERV VVHLSNFRPV KRVPDVIEIF ERIQQRVPAK LVMIGDGPDR SRAEWMVVEK KLQDRVLFLG KQDDVHEKLP AADLMLMPST LESFGLAALE AMACEVVPVA TKAGGVPEVI DHGVDGYLAD VGDIDTMAMY SIDILSDDEK LHEMAKMARF KAQSTYCASK IIPMYEDFYR EVLERAS
|
| |