Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2129 |
Symbol | |
ID | 5899584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2296299 |
End bp | 2297522 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641562618 |
Product | Alpha-galactosidase |
Protein accession | YP_001683755 |
Protein GI | 167646092 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.064626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.32848 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGATCGTG TGTCAAAGCC GGCGGCGCGG TCGCTTGGTA GGATCATCTC TACGCTCGCG GCTCTGTCGC TTTTGATGGT CGCGGGCCTA GCCCACGCGG ACGATCCGCC GCCGCCCTTG AAGGACAACG GCTTGGCCCG CACGCCGCCG ATGGGGTGGA ACAGCTGGAA CAGGTTCGCC TGCGATGTCG ACGAGACGCT GATCCGCAAG ACCGCCGACG CGATGGTCAG TTCGGGCATG CGCGACGCGG GCTATCAGTA CGTGGTCATC GACGATTGCT GGCATGGCGC GCGCGACGCG CATGGCGACA TCCAGCCTGA TCCCAAGCGC TTTCCCAGCG GCATGAAGGC GCTGGGCGAC TACATCCATT CCAGGGGGCT AAAATTCGGC ATCTATTCGG ACGCCGGTTT GAAGACCTGC GGCGGCCGGC CCGGCAGCTG GGGGCATGAA TATCAGGACG CCAAGCAATA CGCCGCCTGG GGCGTGGACT ACCTCAAATA CGACTGGTGC ATGGCTGGCA CGCAGGACGC CCGTTCGGCT TACTACATCA TGTCTTCGGC GCTGCAGGCG AGCGGCCGAG ACATCGTGCT GTCGATCTGC GAATGGGGGA CGTCCAAGCC GTGGCTGTGG GCCGACAAGG TCGGCAATCT CTGGCGGACC ACGGGCGACA TTTACGACAA GTGGGAGGGC GTACGCGACT ACAGCTCCGG CGTCATGAAC ATCATCGACA AGCAGGTCGA ACTCTATCCC TACGCCCGTC CAGGTCATTG GAACGATCCG GACATGCTCG AGGTCGGCAA CGGCGGCATG ACCACCGAGG AGTATCGTTC GCACTTCAGC CTGTGGGCCA TGCTGGCCGC GCCGCTGATC GCTGGTAACG ACATCGCCGC CATGGACGCG GAGACCAAGG CGATCCTTAC CAATAGGGAA GTGATCGCCA TCGATCAGGA TTCGCTCGGC CAGCAGGCGC GGCGGGTTTC CAAGACTGGA GACCTTGAGG TCTGGGTCAG GCCGTTGCAG GGCGGAGGCA GGGCGGTCGT CCTGCTCAAT CGCGGCCCGG CGCCGGCGCC GATCCGTCTG GACTGGAGCC AGTTGGATTA TCCGCCCACG CTAAAGGCCA GGGTTCGCGA CCTCTGGACG GGCAAGGATG TCGGCGTGCG CGAGGCGAGC TATCAGGCAA CCGTCGCCTC GCACGGCGTC GCCATGCTCA AAATCCAACC TTGA
|
Protein sequence | MDRVSKPAAR SLGRIISTLA ALSLLMVAGL AHADDPPPPL KDNGLARTPP MGWNSWNRFA CDVDETLIRK TADAMVSSGM RDAGYQYVVI DDCWHGARDA HGDIQPDPKR FPSGMKALGD YIHSRGLKFG IYSDAGLKTC GGRPGSWGHE YQDAKQYAAW GVDYLKYDWC MAGTQDARSA YYIMSSALQA SGRDIVLSIC EWGTSKPWLW ADKVGNLWRT TGDIYDKWEG VRDYSSGVMN IIDKQVELYP YARPGHWNDP DMLEVGNGGM TTEEYRSHFS LWAMLAAPLI AGNDIAAMDA ETKAILTNRE VIAIDQDSLG QQARRVSKTG DLEVWVRPLQ GGGRAVVLLN RGPAPAPIRL DWSQLDYPPT LKARVRDLWT GKDVGVREAS YQATVASHGV AMLKIQP
|
| |