Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1142 |
Symbol | |
ID | 5898597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1212375 |
End bp | 1213574 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641561624 |
Product | integrase family protein |
Protein accession | YP_001682770 |
Protein GI | 167645107 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.441894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000269765 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGACGAT CTCTCCACAA ACTGACCGCG ATCCAAGCAG CGAAGCTGAA GGTGCCCGGT CGGCATTCCG ATGGCGGCGG TCTCTACCTT TCGATCGATG ACGGCGGGCG GCGACGCTGG GTGTTCATCT ACACACGCGG CGCCAAGCGC ACGGAATTGG GCCTGGGCGG CGGGCGCGAC CTGTCGCTCG CCAACGCGCG TGTCGAGGCG GCAGCGTTCC GGACTATGCT TGCGGACGGC CTCGATCCCA AGGTTGAGCG TGCGAGGGAC GACCAGGCGC AGACCTTCGG CGCCTGTGCT GATGCCTACG TCGAGGCGAT GCGACCGTCA TGGCGAAACG AGAAGCACGC CGCGCAGTGG AAGATGACCT TGACCAAGTA CGCGGCCCCT ATCCGTGAGC GACCGGTCAA GGAACTCACC ACTCAGGATA TCTTGGACGT GCTGCAACCC CTATGGACGC GCACGCCTGA AACCGCCGAA CGCCTACGTG GCCGGATGGA AAATGTGTTG GACGCTGCCA AGGCGAAAGG TTTGCGTACG GGCGAGAACC CGGCTCGCTG GCGCGGCCAC CTCAACCAGC TCTTACCCAA ACGTCAGAGA CTTGCGCGTG GCCATCATGC TGCCCTGGCC TACGATCTAA TCCCCGACTT CATGGCCAAT CTGCGTACGC GCAGTGCGGT CGCGGCGCGT GCGCTGGAGT TCGCGATCCT AACGGCTTCC CGTTCTGGCG AGGTGCTGGG CGCGACTTGG AACGAATTCG ATCTGGATAA AAAGGTTTGG GTCATACCGG CGACGCGCAT GAAGGCCGGA AGGGAACATC GCGTGCCGCT GTCGGCTCGC GCGATGGAAA TCGCTGAAGC GCAGTTTGAC GATGGCAAGG GCTACGTGTT TGCTGGGCCA AAGCTGGGAA AGCCTCTGTC GTCGATGGCG ATGGCGATGC TCCTTCGACG CATGAAGTCG GACATCACCG TCCACGGCTT TCGCTCCTCT TTCCGCGACT GGGCGTCCGA GACGACCGGC TTTTCGCATG AGGTTTGTGA GATGGCGCTA GCCCACACGA TCGCCAACAA GGCCGAGGCG GCTTACCGGC GCGGCGACCT GTTCGATAAG CGCCGTAAGC TCATGGAGGC TTGGGCGGGC TATTGCGCCA CGCAGAAAGC CGACCAAGTC GTGCAATTGC GCCGGGCGTC GGCGGACTGA
|
Protein sequence | MGRSLHKLTA IQAAKLKVPG RHSDGGGLYL SIDDGGRRRW VFIYTRGAKR TELGLGGGRD LSLANARVEA AAFRTMLADG LDPKVERARD DQAQTFGACA DAYVEAMRPS WRNEKHAAQW KMTLTKYAAP IRERPVKELT TQDILDVLQP LWTRTPETAE RLRGRMENVL DAAKAKGLRT GENPARWRGH LNQLLPKRQR LARGHHAALA YDLIPDFMAN LRTRSAVAAR ALEFAILTAS RSGEVLGATW NEFDLDKKVW VIPATRMKAG REHRVPLSAR AMEIAEAQFD DGKGYVFAGP KLGKPLSSMA MAMLLRRMKS DITVHGFRSS FRDWASETTG FSHEVCEMAL AHTIANKAEA AYRRGDLFDK RRKLMEAWAG YCATQKADQV VQLRRASAD
|
| |