Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4609 |
Symbol | |
ID | 4070766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5462041 |
End bp | 5463291 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986649 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_593683 |
Protein GI | 94971635 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.955462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.431404 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATTT GCCTGGTCAC AGCCTTCCCG CCTAGCCGCC GAGGGCTGAA CGAATACGGG TACCACATCG CACGTGAGCT CCAGCGCGAT CCCGTGCTCA GCGTCACTGT GCTCGCCGAC GAGTTGGAGA CGCCTGAGCC GGAGTTGGCA GAATTTGACG TCAACCGCGT TTGGCGCTTC GACAGCCTCT CCAACCCCTC ACGCCTCGCC AAAGCCATCC GCTCCTGCAA GCCGGATGTG GTTTGGTTCA ACCTGTTGTT CTCAACCTTC GGGAACAACC CGCTCGCGGC GTTCTCCGGT CTCACCATCC CCGCCACTAC CCGCATGGGC GGCTGCTACA CCCACGTCAC CCTCCATCAC TTGATGGAGA ACATTGATCT CTCGCATGCC AACGTTCGTT TCCCGCGCGC CTATCGCTTT GCCGGAAATG TCGCCACCCG CATGCTGCTC GCCGCCAACT CGATTAGCGT CCTGTTGCCG GCCTACCGTC GTACGCTCAT CAACAAATAC AAGGGCGAAA ACGTTCACTT CCGCGCCCAC GGCATCATGT CGGCCCGGCC TGAACCGCCC GATTACTCGC GCCGTGGCGT CCCTGACCAT CGCGTCCTCG CCTTCGGCAA GTGGGGCACA TACAAGCGCC TCGAGCTCTT GATGGACTCT TTTGAGCTAG TCGTGAAGCG CCTGCCGAAT GCCAAACTTA TCGTCGCCGG CAGCGATCAC CCTATGACCC CCGGCTACCT CGATAGCATT GCCGAAAAAT ATAAGGACGA CCCGCGCATC GAATTTGTCG GATACGTCGC GGAAGAAGAC ATCCCCGAAC TCTTCCGTAG CTCCAGCGTT CTAGTCATGC CTTACTCCTC CGCCACAGGT TCTTCCGGAG TCGCACATCT CGCAGCCGAG TTTGGGCTTC CCATTATCTG CGCCGATATT CCCGACTTCC ACGAGATGGC CGATGACGAA GGATTGGGCA TCCTCTTTTA TCAAACCGGT AGCGAAAGGA GCCTCGCCGA CCAGATCTGC GGTCTGCTTA ATTCGCCCGA AATGATGAAA GAGATGTCGG AACAAAATTT TTCCGCCGCG CTACGACAGA CCATGCCGCA GATCATCCGG CAATATCTGC GCTCGTTTGA CTTGCACCAG CGCCAGCGCG CGTTGCAGCC CATCGCTCGC TTTCGCCGCA TCCCCGGTTG GGTGCCCTCG CGTTCGGCCA TCTTTCGCGC TGCCGCGCCA AGGTGGGTGC CATGGATGTA A
|
Protein sequence | MKICLVTAFP PSRRGLNEYG YHIARELQRD PVLSVTVLAD ELETPEPELA EFDVNRVWRF DSLSNPSRLA KAIRSCKPDV VWFNLLFSTF GNNPLAAFSG LTIPATTRMG GCYTHVTLHH LMENIDLSHA NVRFPRAYRF AGNVATRMLL AANSISVLLP AYRRTLINKY KGENVHFRAH GIMSARPEPP DYSRRGVPDH RVLAFGKWGT YKRLELLMDS FELVVKRLPN AKLIVAGSDH PMTPGYLDSI AEKYKDDPRI EFVGYVAEED IPELFRSSSV LVMPYSSATG SSGVAHLAAE FGLPIICADI PDFHEMADDE GLGILFYQTG SERSLADQIC GLLNSPEMMK EMSEQNFSAA LRQTMPQIIR QYLRSFDLHQ RQRALQPIAR FRRIPGWVPS RSAIFRAAAP RWVPWM
|
| |