Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1425 |
Symbol | |
ID | 5898880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1513375 |
End bp | 1514715 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641561912 |
Product | NHL repeat-containing protein |
Protein accession | YP_001683053 |
Protein GI | 167645390 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCAAAG CCACCCTGCT TTTCCTGGGA TCCACCCTGG TCCTGTCAGC CTGTGGGGGC GGCCCCGCCC TGCCGCCCGA GCAAGGCTAC GGCCCAAGTC CTGTCCTGCC CGCCGCCAAG AAGCAGGTTC TGCCGGTGAT GAAGATCGCC CCGGCGGTCG GGTGGGCTGA AGGGCAGACG CCTGAGACCG CACCCGGCTT CGTCGCCACC GCCCTGGTCC GCGGCCTGGA TCATCCGCGC TGGCTCTACG TGCTGCCAAA CGGGGACGTT CTGATCGCCG AGACCAACGC CCCACCCAAG CCCGAGGACG GCAAGGGCAT CCGGGGTTGG TTCCAGAAGA TAATCATGAA GCGCGCCGGA GCCACGCCCG CCTCGGCCAA CCGCATCACC CTGGTGCGCG ACGCCGACAA TGACGGAACG CCCGAGGCCC GCACTGTCTT CCTGTCGGGC CTGAACTCGC CGTTCGGCAT GGCCTTGGTG GGCGACACCT TCTATGTCGC CGATAGCGAC GCCCTGCTGG CCTTCCCCTA CCAGCCCGGC CAGACGCAGA TCACCGCCGC GCCACGCAAG GTCGCCGACC TCCCGGCCGG ACCGATCAAC CACCACTGGA CCAAGAACGT CATCGCCAGC CCCGACGGAT CCAAGCTTTA TGTGACGGTC GGCTCCAACA GCAATGTCGG CGAGAACGGC ATGGCGAACG AGGAGCGCCG GGCCGGCATC CTGGAGATCG ACCCGGCCAC CGGCGCCAGC CGCGTCTTCG CCTCGGGCCT GCGCAATCCC AACGGCATGG GCTGGCAGCC CCAGAGCGGC AAGCTATGGA CCAGCGTCAA CGAGCGCGAC GAGATCGGCA ACGACCTGGT CCCCGACTAC ATGACCTCGG TCCAGGACGG CGGCTTCTAC GGCTGGCCCT ACAGCTACTA CGGCCAGACG GTGGACACGC GGGTCAAGCC GCAAAGGCCC GATCTGGTGG CCAAGGCGAT CAAGCCCGAC TACGCCCTGG GCGCCCACAC CGCTTCGCTG GGCCTGACCT TCTACGACGC CGACGCCTTC CCGGCTCGCT ACAAGGGCGG CGCCTTCATC GGCCAACACG GCTCGTGGAA CCGCAAACCG GTCAACGGCT ACCGGGTGGC CTTCGTGCCG TTCGCGGGCG GTGTTCCCGC GGGTCCGGCC GAGCCGTTCC TGGTCGGGTT CCTCAACGCC AAGGGCGAGG CTCTCGGCCG GCCAGTGGGC GTGGCGGTCG ACAGGACCGG CGCCCTGCTG GTGGCCGACG ACGTCGGCAA TGTGGTCTGG CGGGTGGCCG CCAGCGCCCC GCCGACGCCG GCCGCGAAGC CCGCGCCCTG A
|
Protein sequence | MLKATLLFLG STLVLSACGG GPALPPEQGY GPSPVLPAAK KQVLPVMKIA PAVGWAEGQT PETAPGFVAT ALVRGLDHPR WLYVLPNGDV LIAETNAPPK PEDGKGIRGW FQKIIMKRAG ATPASANRIT LVRDADNDGT PEARTVFLSG LNSPFGMALV GDTFYVADSD ALLAFPYQPG QTQITAAPRK VADLPAGPIN HHWTKNVIAS PDGSKLYVTV GSNSNVGENG MANEERRAGI LEIDPATGAS RVFASGLRNP NGMGWQPQSG KLWTSVNERD EIGNDLVPDY MTSVQDGGFY GWPYSYYGQT VDTRVKPQRP DLVAKAIKPD YALGAHTASL GLTFYDADAF PARYKGGAFI GQHGSWNRKP VNGYRVAFVP FAGGVPAGPA EPFLVGFLNA KGEALGRPVG VAVDRTGALL VADDVGNVVW RVAASAPPTP AAKPAP
|
| |