Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2785 |
Symbol | |
ID | 5900240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3021684 |
End bp | 3023864 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641563277 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_001684410 |
Protein GI | 167646747 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.457375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00294021 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCGAGGA CCGCGACGCC GGGCAAGACC GACGGCGGCT GGGCGGCGCG TCTTAGCACG CGGATTCGCG CCCGTCCTAG TCCGGCGCGG GTTCTGGCCG CGCTGATCGG CGAGATCGAC GCCAACCGCG AGCGCTGGAT GCTGTGGTCG CCGGTGGCGT TTGGCCTCGG CGCGGCCGGC TATCTGGAAC TGAGGACCGA GCCGTCATGG GCGCTGCTGG TCGGGCTAAC GGCGGGGCTG GCGATCCTGG CGGGGCTGAT CGGACGTCGC TCAACGCGGG GCTTGTCGGT GTTTCTGGTG CTGGCCACCC TTCTGGCGGC TGGAGCCTTG GCGGGCAAGG TGCGCTCGAA CGCCGTGGCC GCGCCGATTT TGGTTGGTGA GCGGGCGGTG ATGACCCTCG ACGGCTTCGT GGTCGATGTG GTCAGCCCCG GAGCGGGCGG GCCGCGCCTG CTGATCGCGC CCGTCGAGAT CAGCCGCCTC ACGCCGGAGG CCACGCCCAA GCGGGTGCGG GTGACGGTCG AGGCCGAGGA TATCCCGGTG CCGGGCCAGG CGATCCGCCT GCGGGCCATG CTGGGGCCGC CGCCGCCGCC GGCCGCGCCA GGAGCCTATG ACTTCGCCCG CGACGCCTGG TTTCATGGCG TGGGCGGGGT GGGCTTCGCG ATCGGCGAGT CGCGGCCCGT GGTATTGGAT CCGCCGCCGT GGCGGTTGCG GGCGGCCATG GCGGTCAACG CCTTCCGCTG GCGGCTGGCC AGCCGGATCG TCGCGCGGAT GGGCGCCGAG CGCGGCGGCG TCGCCGCGGC CATGGTCACC GGCCACGAGG CCTGGATCAC CCAGGAACAG ACCAACGCCA TGCGCGCCTC GGGCCTGGCC CACATCCTGT CGATCTCGGG CCTGCACATG GCGATCGTAG GGGGCTTCCT GTTCGGACTG GTGCGGCTAG GGATCGCCGC CTGGCCGTGG GCGGCGCTGC GGGTTCCGGG CAAGAAGGTC GCGGCCTTGG CGGGTTTGGC GGCGATCGGG ACCTATCTGG TGATCTCGGG CGCCCCGCCG CCGGCCCTGC GGGCGGCGAT CACCGCCACC GTGGCCTTCG CGGCCATCCT GTTTGACCGC CAGGCCATCA CCCTGCACGG CCTGGCGATC GCGGCGCTGG TCATCCTGCT CGTCCAGCCG GAGTCGGCGG GGGCTCCGGG CTTCCAGATG TCGTTCGCCG CCACCGCCGC CCTGGTCGCC TTGGCCGAGG CCTGGCCGCG GCCGGTGCGC GAGATCTCGG CGCCCTGGTG GATCCGGGCC ATTCAGGGCT CCATGAGCTG GCTGGCGGTC AGCATCGGGG CCAGTTTCGT GGCCGGCATG GCGACGGGGC CGTTCGCCAT GCAGCACTTC AACCGCGTGG CGGTCTGGGG CCTGCCGGCC AATCTGGCGG TCGCGCCGCT GTCGTCGTTC GTGATCATGC CGTTCCTGGC GATCGGCGCG GCGCTGGAGC CCTTCGGCCT GGGCGGACCG TTCCTGGCGA TCGCCGGCTG GGGCATCGGG GCGATGATGT GGATCGCCGA CGGCTTCGCC AGCGCCGGCG GAGCGCAGCG GCTGGTCGCC AGCGGGCCGC CCTTCACCCT GGCCCTGGCG TTCGTGGGAT TGATGCTGCT GTGCCTGTGG CGCGGCCGCC TGCGCTGGCT GGGCGCGCCC CTGGCCCTGG CGGTGGCCCT GTGGCCGCGC GCGGCGCCGC CCGATGTCTG GATCGCCCCG GACGGCAGCA CGGCGGCCGT GCGCCAAGGC CGCGAGGCGG TGCTGCTGCG CCCCGACGCC CGGCGGTTCG GAGCCGAGTT GTGGAGTCGT CGACGCGGCC TGGCCGCCCC GAGCGACACC AAGCCCAGCG CGCTCTACGC CTGCGACCGG CGATCCTGCG TACCGACCGC TGCCTCGCCG GTCGCCCTGG CCCTGTCCTG GAGCAAGGCC GCGCCGGACG CCGAGGCGTT GGCGGCGATG TGCGTCGAGG CCGAGATCGT CGTGCTGCGC GGCGCCGCGC CGGTCACGCC ACCGGCCTGC CGCGACCGGA TCGTGCTCGA CGCCGACGAT TTCGCCGTCG GCGGGGCGGT CGAGCTCTAT CGTCGGGGCG GCGACTGGTG GATCGTCTGG GCGCAGCCGT TGCGGGGCGT TCGGCCCTGG ACGCGCGCGG CGGGGAGCTA G
|
Protein sequence | MARTATPGKT DGGWAARLST RIRARPSPAR VLAALIGEID ANRERWMLWS PVAFGLGAAG YLELRTEPSW ALLVGLTAGL AILAGLIGRR STRGLSVFLV LATLLAAGAL AGKVRSNAVA APILVGERAV MTLDGFVVDV VSPGAGGPRL LIAPVEISRL TPEATPKRVR VTVEAEDIPV PGQAIRLRAM LGPPPPPAAP GAYDFARDAW FHGVGGVGFA IGESRPVVLD PPPWRLRAAM AVNAFRWRLA SRIVARMGAE RGGVAAAMVT GHEAWITQEQ TNAMRASGLA HILSISGLHM AIVGGFLFGL VRLGIAAWPW AALRVPGKKV AALAGLAAIG TYLVISGAPP PALRAAITAT VAFAAILFDR QAITLHGLAI AALVILLVQP ESAGAPGFQM SFAATAALVA LAEAWPRPVR EISAPWWIRA IQGSMSWLAV SIGASFVAGM ATGPFAMQHF NRVAVWGLPA NLAVAPLSSF VIMPFLAIGA ALEPFGLGGP FLAIAGWGIG AMMWIADGFA SAGGAQRLVA SGPPFTLALA FVGLMLLCLW RGRLRWLGAP LALAVALWPR AAPPDVWIAP DGSTAAVRQG REAVLLRPDA RRFGAELWSR RRGLAAPSDT KPSALYACDR RSCVPTAASP VALALSWSKA APDAEALAAM CVEAEIVVLR GAAPVTPPAC RDRIVLDADD FAVGGAVELY RRGGDWWIVW AQPLRGVRPW TRAAGS
|
| |