Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4781 |
Symbol | |
ID | 5902243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5162388 |
End bp | 5164178 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641565301 |
Product | heparinase II/III family protein |
Protein accession | YP_001686399 |
Protein GI | 167648736 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.997993 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGGCG CCCCCGCGAC AGGTTCCGCC CGCAAGCCGA CCGCCCTCGC CGGCGGCCCC CAAGGGCGTG GCAAGGCCCA TGGTCGCGGC AAGGCGCCCA AGGGGCTGTT TTTCAAGGCC TTGGGCGCTT CGATACGCGG CAATCTCGAG CGCGAATGGT TCGGTTCGCC GCCGCATCGC GCCCTGATCA GCCGCCCGCG CCCCGTCGGC CTGGCCGTCC GTCCGCACGA CCCGCGACCG GTCGACATCG AGCGCGGCCG CCAACTGCTT GACGGCGTCA TGACCCTGGA CGGCGGCGCG CTGCGTCTGG GCGAGACCGG CGACCCCTTC GACCAGCCCA GCCCGACCCG TCAGTTCGCG ACCGCCCTGC ACGGCTTCGA CTGGCTGCCG CATCTGTTGG CCGCCGGTCC CGGCGCGCCA CGTCTGGCCT TGCGGCTGCT ACAGGACTGG CGGCGGGTGT TCGGGGTGTG GAACGCCTTT TCATGGAGCG GCGAGCGGCT GGAGCGCCGC ACCTTCCACC TGGCCTGCGC CGCCCGCGCC CTGTCGCCCG AGGCCTCCGA CGCCGAGATC TCGGCCATGA CCATGGACAT CGCCCGGGGC GCCCGCCAAC TGCTCAAGGC CTCCGACGCC CCCGACCGCC GGCTGGAACG CGCCGTCGTC GTGGCCATCG CCGGCTGCGC CCTGACCGGC AAGGCCAGCG ACCGGTTGAT GGCCGCGGGG CTCAAGCGGG TGGCGGCCGA TATCGAGAGC ATCGTCCTGC CCGATGGCGG CCACGCCAGC CGCTCGCCCG AGGCGGGGCT GGAGCTGTTG TTCGACCTGC TGACCCTCGA CGACGCCCTG GGCCAGCGCG GCCGCCCGGC GCCCGAGGCC CTGGGCCGGG CCATCGACCG GCTGAGCTCC TCGGTGCGGT TCTTCACCCT GGCCGACGGT TGCCTGGCGG CCTTCCAGGG CGGCGAGGCC GTGGAGCCGC GCCGGGTGGC CGCGGCCCTG GCCCACGACG ACCACGGTCC GCGTCCGCCG CAAAGCGCGC CGCATGTCGG CTATCAGAAG ATGCAGGGCG GCAGCATCCA GGTGATGGCC GACGCCGGCC CGCCGGCCAG AGGCGTGCTC AGCGTCTCGG CCTGCGCCCA GCCGGCCGCC GTCGAGATCG TCTGCGGCAA GGATCGGTTG ATCACCAGCT GCGGCTGGAG CCCGGAGGCC ACCGGCGCCA ACGCCTTCCG GCTGTCGGAC GCCGCTTCGA CCCTGTCGGT CGGCGACGGC TCTGCCGGAC GACCGCTGTC GGGCTGGCGA TCCGGCGCCC TGGGTCCCTG GCTGATCGAC GGCGCGACCG ATGTCGAGAT CAAGCGTCAC GACGCCGACG TCGGGGTGTG GCTGGACATC GTCCACGACG GCTGGCGCCG CCTGGGCCTG ACCCATGCCC GCCGCCTGTA TCTCGACCTC AAGGCCGATG AGCTGCGCGG CGAGGACAGC CTGATCCCGC TCCCAGACAA AAATGGCGTC TCGCCCCATG CCGACGGGCC GCGCCGCTAC CTGCCGTTCA TGATCAGCTT CCACCTGCAT CCGGACGCCC GCGCCTCGCT GGCCCGCGAT GGCAAGAGCG TGCTGATCAA GGGGCCGTCC AATGTCGGCT GGTGGCTGCG CAACGACGCC GTCGATGTCG CCATCGCCCC CTCGGCCCAT TTCGACCACG GTCACGCCCG GCGGGCGGGA ACCATCGTGC TCAGGAGCCA GGTGCGTCCC GAGAAGGGCG CCAAGATCCG CTGGAAGCTG GCCCGGGCGG CGGATCACTG A
|
Protein sequence | MDGAPATGSA RKPTALAGGP QGRGKAHGRG KAPKGLFFKA LGASIRGNLE REWFGSPPHR ALISRPRPVG LAVRPHDPRP VDIERGRQLL DGVMTLDGGA LRLGETGDPF DQPSPTRQFA TALHGFDWLP HLLAAGPGAP RLALRLLQDW RRVFGVWNAF SWSGERLERR TFHLACAARA LSPEASDAEI SAMTMDIARG ARQLLKASDA PDRRLERAVV VAIAGCALTG KASDRLMAAG LKRVAADIES IVLPDGGHAS RSPEAGLELL FDLLTLDDAL GQRGRPAPEA LGRAIDRLSS SVRFFTLADG CLAAFQGGEA VEPRRVAAAL AHDDHGPRPP QSAPHVGYQK MQGGSIQVMA DAGPPARGVL SVSACAQPAA VEIVCGKDRL ITSCGWSPEA TGANAFRLSD AASTLSVGDG SAGRPLSGWR SGALGPWLID GATDVEIKRH DADVGVWLDI VHDGWRRLGL THARRLYLDL KADELRGEDS LIPLPDKNGV SPHADGPRRY LPFMISFHLH PDARASLARD GKSVLIKGPS NVGWWLRNDA VDVAIAPSAH FDHGHARRAG TIVLRSQVRP EKGAKIRWKL ARAADH
|
| |