Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0788 |
Symbol | |
ID | 5898243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 833646 |
End bp | 835013 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641561269 |
Product | histidine triad (HIT) protein |
Protein accession | YP_001682417 |
Protein GI | 167644754 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0537] Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.864692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.369294 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCC AGATCGCCGC CGGCCTCGCC GCCCTGATGC TCCTTGGTCT CGCCGCGTCC GGCCAGACCG AGCCGCGTTT CGCCCCGATC CCCAAGGCGG TCGTGGCGGA CCCGGTCCGC GACGCCGCCC ATCCGGCCGA CATGGCCGCC TTCACGCTTC CCACCGGCGG CGTGCGGGTC AATGCGCTGA TGTACATCGC CTCGGGCGAC CAGCCGCATC CGACGATGCT GTTCCTGCAC GGCTTTCCTG GCAACGAGAC CAATATCGAC CTGATGCAGG CGGTGCGCCG GGCCGGCTGG AACGTGTTGA AGATCAACTA TCGCGGCTCG TGGGGCAGTC CGGGCAAGTT CTCGTTCGCG GGCGCCCGCG CCGACGGCGA GGCGGCTGTG GCCTTCCTGT TCGATCCGGC CAACATCGCC AAGTATCACA TCGACCCCAA GCGAATCGTG GTGGCCGGCC ACAGCATGGG CGGTTTCATG GCCGCCGACG CCGCGGCCGC CGAGCCCCGC CTGGCCGGAA CGGTGCTGAT CGACGCCTGG GACATCGGCA AGAACGCGGC CCAGATCACC AGCCCGGCCA CGCGCAAGGC GGCCGCCGAC AAGATGCGAC CCGACACCCG GCCCCTGGCC GGGACCAGCG CCGAATTGCT GGTCAAGGAG ATCGAGACGA ACGCCGCCAA GCTGGACCTC GAGGCGCTGA GCGCCCGGAT CGCCAACCGC CCTCTGTTGA TGGTTGGCGC CGAGCACGCC CGGGCGCCCA CGATCCGCAA GCTGGCCGCC GCGGCCCGTC AGGCCCAGGC CACGGCCCTG ACCGAAACCT ACATGGACAC CGACCACGGC TTCTCGGACC ATCGCATCGC GCTGGAGGCC GAGGTGGTCC GCTGGCTGGG CCAATTCGAT CCGGCCTCGG CCAAGCCCGG CACGCCGCGG ATCCCCCTGA AGGCGCCCTA TGACGAGGCC AACCCGTTCG CCAGGATCCT GCGCGGCGAG ATCGCCGTGC CCAAGGTCTA TGAGGACGAC CAGGTGCTGG CCTTCATGGA CTACGCCCCG GCCGAGCCGG GCCACGTGCT GGTGATCTCC AAGACCTCCA AGGCCCGCAA CCTCCTGGAG ATCTCGCCCC AGGACCTGTC GCGGATCATG GCCGTGGCCG CCCGGGTCGG CCAGGCCCAG GTCGATGGCC TCGGGGTCGA GGGTTTCACC ATCGTCCAGA ACAACGGCGT CGGCCAAAGC GTGCCGCACC TGCACATCCA CGTCATCCCA CGCGTGGCCG GCAAGCCGCT GATGTTCGTC GAGAACGAGA AGGGCGACCC CAAGGACATC GCGGCCATGG CCGACAAGAT CCGGTCGGCG ATGAAAGCTC CCCAATAG
|
Protein sequence | MKRQIAAGLA ALMLLGLAAS GQTEPRFAPI PKAVVADPVR DAAHPADMAA FTLPTGGVRV NALMYIASGD QPHPTMLFLH GFPGNETNID LMQAVRRAGW NVLKINYRGS WGSPGKFSFA GARADGEAAV AFLFDPANIA KYHIDPKRIV VAGHSMGGFM AADAAAAEPR LAGTVLIDAW DIGKNAAQIT SPATRKAAAD KMRPDTRPLA GTSAELLVKE IETNAAKLDL EALSARIANR PLLMVGAEHA RAPTIRKLAA AARQAQATAL TETYMDTDHG FSDHRIALEA EVVRWLGQFD PASAKPGTPR IPLKAPYDEA NPFARILRGE IAVPKVYEDD QVLAFMDYAP AEPGHVLVIS KTSKARNLLE ISPQDLSRIM AVAARVGQAQ VDGLGVEGFT IVQNNGVGQS VPHLHIHVIP RVAGKPLMFV ENEKGDPKDI AAMADKIRSA MKAPQ
|
| |