Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0313 |
Symbol | |
ID | 5897587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 352321 |
End bp | 354231 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641560797 |
Product | Alpha-galactosidase |
Protein accession | YP_001681948 |
Protein GI | 167644285 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCC GCTTCCTGCT GGCCTCGGCC GCCGCCGCCG GCCTTCTGGC CCTGTCCATG CCCGCCGTCG CCCAGAACGA CGCTCTGGCG GCCACGGGCA AGTGGTCGAT CCCCGAGCGG TCCCAGGCGC GCACGCCGCC CATGGGCTGG AATTCGTGGA ACGCCTTCCG CACCGAGGTC GACGAGGCCA AGGTGGTGGG CGCGGCCAAG GTGCTGGTCG ATAGCGGCCT GTCCAAGCTG GGCTACACCT ATGTCAACAT CGACGATGGC TGGTGGCTCA AGCGTCGCCA GTCGGACGGG CGTCTGGAGA TCCGCACGGC GATCTTCCCG TCGGCCAAGG TGACGGGGAA AGACACCAGC TTCCGTCCCT ATACCGACGC CTTGCACAAG ATGGGCCTGA AGGCGGGCAT CTATACCGAC ATCGGCCGCA ACGCCTGCTC GCAGGCCTAT GACCTGCATT CCCCCAACCT GCCCGAAGGC ACGACGGCCG AACGCGAAAT CGGACTTCAG GGCCACGTCG ATCAGGACAT CGCGCTCTAT TTCAAGGACT GGGGTTTCGA CTACATCAAG GTCGACGCGT GCGGCATCAA TGTCTACGGC CCAGAGACCG ACATCGTGCG CGAGCATGGC TACAGGGCCG TGCCGCCCCT GATCGACCAG GTGTCGATCA ACCGCACCGA CGTGCCCGCC GTCCGGGCCC GGTACGCCGA GGTGGCCCAG GCGCTGAAGA CCTACAATCC GGACGGCGAC TACATCCTGG CGATCTGCAA CTGGGGCTCG GCCGACGTCA GGTCCTGGGG CAAGGACGTG GGTCACCTGT GGCGCACCAG CGGCGACATC ACCCCGACCT GGACGCGCAT GCTGCACAAT TTCGACAGCG CCTCGACCCG CGCGCTCTAC GCCAAGCCTG GCGCGTGGAA CGATCCGGAC ATCCTGTTCA TCGGCCACGG GGAGTTTGAT CAGAACCACC TGACCGAGGC GCGTTCGCAC TTTTCGCTTT GGGCCATGAT CAACGCGCCG CTGCTGATCA GCTACGACCT GCGCCAGGCG CCGCGAAGCT TGCTGGACAT CTGGGGCGCC GCCGACATCG TGCGGCTCAA CCAGGATCCG GGCGGCCACC AGGGCGTCAT CGCCTACGCC TCCGACGACG TGCAGATCAT CGTCAAGACC CTGGCCAGCG GCAAGAAGGC CGTGGCCCTG TTCAACCGGG GCCTGGGCAA GACCGACGTG ACCCTCACGG CCGCGCAGCT GAAATTCGCC GGCGACGCGC CGATCCAACT GAAGAACCTG TGGGACAAGA CCGCGCCGGC CTCGTTCACC GGTGAGACAA GCTTCCCGCT GGAATCGCGC CAGACCCTGG TCTTCGAGGC CTCTGGCTCG CGAGCGCTCG GCGACGGCGT CTATCTGTCG GAAATTCCGG GCGACGTGAA CGTCGCTGTC GATGGCGTGA TCACGCCCGA GCCGGATCCG GTCGTTCACC GCATGCGCAA CGCCTGGGGC GAGACCCGTG GCTCGGGCGA GCGCCCGACC TATGCCGGCT GGGGCGGCGC CCAGGCCGAC GCCACGCCCT ACGACCAGGC GCTACGCATC GGCGGCCAAG GCTTCGACAC CGGCATCGGG GTGCTGGCCA ACTCCCGCAT CGAGGTGCGC AACGCGGGCC ATGCGCGCTT CGAAGCGCGG GTCGGCGTGG ACGATTCCAC ACGCAACACC AAGGACAAGG TGCGCTTCTC CGTCTACGGC GACGGCCAGC TCCTGGCCCA AAGCCCGTCC ATGAGCCTCG GCGAGGCTCC GCGCTCGCTC ACCGCCGACA TCAAGGGCGT TCGCATCGTC GAGATCGTCG CCCGATCGGA AACCGCGACC AGCGACCTGC CGCTGGTCGT CACCTGGGGA GACGCGGCCC TGCGCCGCTG A
|
Protein sequence | MTVRFLLASA AAAGLLALSM PAVAQNDALA ATGKWSIPER SQARTPPMGW NSWNAFRTEV DEAKVVGAAK VLVDSGLSKL GYTYVNIDDG WWLKRRQSDG RLEIRTAIFP SAKVTGKDTS FRPYTDALHK MGLKAGIYTD IGRNACSQAY DLHSPNLPEG TTAEREIGLQ GHVDQDIALY FKDWGFDYIK VDACGINVYG PETDIVREHG YRAVPPLIDQ VSINRTDVPA VRARYAEVAQ ALKTYNPDGD YILAICNWGS ADVRSWGKDV GHLWRTSGDI TPTWTRMLHN FDSASTRALY AKPGAWNDPD ILFIGHGEFD QNHLTEARSH FSLWAMINAP LLISYDLRQA PRSLLDIWGA ADIVRLNQDP GGHQGVIAYA SDDVQIIVKT LASGKKAVAL FNRGLGKTDV TLTAAQLKFA GDAPIQLKNL WDKTAPASFT GETSFPLESR QTLVFEASGS RALGDGVYLS EIPGDVNVAV DGVITPEPDP VVHRMRNAWG ETRGSGERPT YAGWGGAQAD ATPYDQALRI GGQGFDTGIG VLANSRIEVR NAGHARFEAR VGVDDSTRNT KDKVRFSVYG DGQLLAQSPS MSLGEAPRSL TADIKGVRIV EIVARSETAT SDLPLVVTWG DAALRR
|
| |