Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4522 |
Symbol | |
ID | 5901983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4893816 |
End bp | 4895522 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641565041 |
Product | glycosyl transferase family protein |
Protein accession | YP_001686140 |
Protein GI | 167648477 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | [TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.185613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.129654 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTCG AATCTCGCCT GGATGACTGG AGCCGCGGGT GGCGCGGCCC GCTGTTCGCC GCGCTGGTGG CCCTGATCGC CGGCCTGCCC GGGCTGTTCG CCATGCCGCC GCTCGACCGC GACGAGTCGC GCTTCGCCCA GGCCACCGCC CAGATGCTGG AAACCGACGA CATGGTCGTC ATCCGCTTCC AGGATCAGCC GCGCTTCAAG AAGCCGGTCG GCATCCACTG GCTGCAGGCG GCCAGCGTCA CCGCCTTCTC GGCCGCCGAG GATCGCGGCA TCTGGGCCTA TCGCATTCCC TCCCTGCTGG GCGCGATGCT GGCGGCGGCG GCCTGCGCCT GGGGTGCGGC GGCGCTGCTG GGACCGCGGA CGGGCCTGCT GGCCGGCGGC ATCCTGGGCG CGACCCTCCT GCTGTCGACC GAGGCCTTCA TCGCCAAGAC CGACGCGGCC CTGTGCGGCT TCACCACCCT GGCCATGGCC GCCCTGATGC GGATCTACGC CGCCCACCTG AACGGCGGAA CCATCACCCG CTGGACCAAG CTGGCCTTCT GGGTCGGCCT GGCCATGGGC GTGCTGATCA AGGGTCCGGT CGGGCTGATG GTCGTGGTGC TGAGCCTGCT GATGCTGGCG CTGTGGGACC GCAAGGCCCG CTGGCTGAAG GACCTGGGCT GGAGCTGGGG CCTGATCCTG CTGGCGGCGA TCGTCCTGCC CTGGGCCACG ATGATCACCG TGGCCACCGA CGGGGCCTTC TGGTCGACGG CGGTGGCCGG CGACCTGGCG CCGAAGCTGG CTGGCGGCCA GGAAAGCCAC GGCGCGCCGT TCGGCAGCTA CGCCCTGGCG GCCTTCCTGC TGGTGTTTCC CGCCACCCTG CTGTTGCCGG CCGGCCTGGC CCAGGGCTGG ACCCAGCGCA AGGACGCCGG GATCCGCTTC GCCCTCTGCT GGCTGATCCC CACCTGGCTG GTGTTCGAGA TCCTGCCGAC CAAGCTGGTC CACTACACCC TGCCCGCCGT GCCGGCCCTG GCCATGCTGA TGGCCGCCGC CCTGCGCCGC CCCCTGGGCG GGATCTCGCG GGCGATCGGC GCGGTGCTGT CGACCCTGGC CGGGGTGCTG CTGGCCGGTC TGGTCGGCTA TCTCTATTCG GCGCATGGCG ATCCGAGCGA CCTGCCCGTG ACGATCCTGA CCGCCCTGCT GTTCCTGGCC GCCGGCGTCG TCGGGACGAT CCTGATCCTG CGCAAGACCG CCGCCACGGC CCTGGTCGCG GCCGGGGTCC TGGGTATCCT GGCCCATGGC GCCCTGGTGG GCCTGTTCGT GCCGCGCCTG GAACCGCTGC TGCTGGCGCC GCGCCTGGAA AAGGCCCTCG AGCGGGCCGA CCTGGCGCCG CGCGGCGGCG CGCCCGGTCC CGTGGCCGTC ACCGGCTATG CCGAGCCCAG CATGATCTTC CTGCTGGGCA CCACCACCGA ACTGACCGAC CCGGCCGGCG CCGCCCAGGC CGTCGCCGAA GGCCGGCCGG CCGTGGTCGA GGGACGCCAG GAGAAGGCCT TCCAGGCCGC CATGGCCGCC CAGGGCCAGG CCGTTCGCCC CGCCGGCGTG GTCGAGGGCT TCGACTATTC CGATGGCGAC AAGGAACGGC TGACGCTCTA TCGCGGCGCG CCGATCCGGC CCGATGTCGA AGACGACAGC GCGGCGCAGC AGGAGACCCG CCCATGA
|
Protein sequence | MTLESRLDDW SRGWRGPLFA ALVALIAGLP GLFAMPPLDR DESRFAQATA QMLETDDMVV IRFQDQPRFK KPVGIHWLQA ASVTAFSAAE DRGIWAYRIP SLLGAMLAAA ACAWGAAALL GPRTGLLAGG ILGATLLLST EAFIAKTDAA LCGFTTLAMA ALMRIYAAHL NGGTITRWTK LAFWVGLAMG VLIKGPVGLM VVVLSLLMLA LWDRKARWLK DLGWSWGLIL LAAIVLPWAT MITVATDGAF WSTAVAGDLA PKLAGGQESH GAPFGSYALA AFLLVFPATL LLPAGLAQGW TQRKDAGIRF ALCWLIPTWL VFEILPTKLV HYTLPAVPAL AMLMAAALRR PLGGISRAIG AVLSTLAGVL LAGLVGYLYS AHGDPSDLPV TILTALLFLA AGVVGTILIL RKTAATALVA AGVLGILAHG ALVGLFVPRL EPLLLAPRLE KALERADLAP RGGAPGPVAV TGYAEPSMIF LLGTTTELTD PAGAAQAVAE GRPAVVEGRQ EKAFQAAMAA QGQAVRPAGV VEGFDYSDGD KERLTLYRGA PIRPDVEDDS AAQQETRP
|
| |