Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2391 |
Symbol | |
ID | 5899846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2599403 |
End bp | 2602267 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641562882 |
Product | transcriptional regulator |
Protein accession | YP_001684016 |
Protein GI | 167646353 |
COG category | [R] General function prediction only |
COG ID | [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.814373 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGGGACG CGCGCGCTTT GAGCCGGGGT CAGGCCTGCG ACGAAAGCCA AGCCGCCTTC GGACCCTACC TGTTGGCGCC CGGCCGGCGG CTGCTCACGC GCCAGGGCGT TCCGGTGGCG GTGGGCGCTC GAGCGCTCGA GATCCTGATC GCCCTGTTGA GCGAGCACGG CCGCGTGCTC ACCCACCGGC AACTGGTTGA GCGCGCCTGG GCGGGTCTGA CGGTCGAGGA GTCCAACCTG CGCGTGGCCG TCTCGGGCCT GCGAAAGACC CTGGGGGACG GCGAGGGCGG CGCCCGCTAT ATCGAGAACG TGGTGGGACG GGGCTACTGT TTCGTGGCCC CAGTCCGATG GCTGGAGGCC GAGCCGGCCG ACGCTTCGCC TTCGATCGTA CCCGCCAAGC TCCGCGGCGG TTCCACGCGT CTACCGATCC CCGCCGCGCG CGTCATCGGG CGCGAAAACG TCGTTGAAAA GCTCGTTGCG GCCCTGGGCG ATCGAGCGCT GGTCACGGTG GTCGGGGCGG GCGGACTGGG CAAGACGACC GTCGCGGCCC TGGTGGCGCA GGCCCTTAAG ACCCGAGCAG GGGTCGAGGC CGTCTTTGTC GATCTCAGCG CGACGGCGGA CGAGGCGGCC TTGACGAAGG CCGTGGCGGT CGCCCTGGGC CTGGACGGGC CTGGGCGGGA TTTGGAGGAG GCAATCCTTA CCTGCCTGGC GAACCATCGC GTGTTGCTCG TCTTGGACGG CTGCGAGCAT GTCGTCGAGG CGGTCGCGGC GTTTTGCGAC CGCCTCACTC GCCGGCTTGA TCTCTATCGG GTCCTGGCCA CCAGCCGCGA GGCGCTGCGG GTGGAGGCCG AGACCGTGCA CCTGCTGGAG CCGCTCGCCT TTCCGTCCGC CGATGGGGAT CCCTCGCTGG AGAGCCAGTT GGGCTATTCG GCGGTGCAAC TGTTCGTCGA ACGCGCGGAG GTCAGCGGTC GGCGTGGGCC TTGGACGCCG GTGGAAGCGC GGCTGGCGGG CGATATCTGC AGGCGGTTGG ACGGCGTGGC CTTGGCCATC GAACTCGTCG CCAGCCGCGT CGGCGCCTTG GGCCTGGCGG GCGTGGCCGA TCTGCTCGAC GATAGCCTTG GTCTCCAGTG GCCCGGCCGA CGTGACGCCG CGCCCCGTCA CCAGACGCTG CAAGCCCTGC TGGACTGGAG CCACGACCTG CTGGGCGAGC GCGACCGGCG CGTTCTGCGC CGCCTTTCGC TGTTCACAGG ACTGTTCACC CTGCGCGACG CCCACGCGGT GGCGGGCGAG CCGGGCGACG ACACGCTCGA CGTCGCCAAC GCCCTGACCA GCCTGGTCGA CAAGTCGCTG GTCTGGACCT CGCACGGCGG CGTCGGGCCG GTTCAGTTCA GGCTGCTGGA CACCACGCGT GTCTATGCCC TCGCGAAGCT GACGCGGAGC CAGGAAACCG ACGTGATCGC TGAGCGCCAC GCCGGGCAAG TTCTCGCGCG TCTGACGGCC GATGAGGTCA GCGACAACGC CTCGCTATTC CAGAGGGGCG CGCCCTTGGC CTCAGCCACG GTGGGCGACG TGCTCTCGGC GTTGCGTTGG GCTCAGGCCG CGCGTCCCCG GGACCTGTTC GTCAGCCTGG GCGTCGCCGC CGCGCCCTTG CTGTTGTCGC GCGCCATGCT CGACGAGTGC GAGGCCTGGT GCCTACGCGC GCTGACCGCC CTGCCGGACG AACGCCGCGG CGGCCTGGAA GAGCTCAGAT TGCTGGAAGC GGTGGGCGTC GCCCGCATGT TCGGACGCGG CAACCATGAG ACCGTGCGTG AAATCATCGA TCGCGGGATG GCGCTCGCGC AAGCGCTCGG CGCCACGACC GAACAGGTGC ACCTGCTGGC CGGCCAGCAT ATTTTCCTCA CGCGGATCGG CGACTTTTCC GCGGCGCTGG ACTGTGGGCG CCGATGCGCT GAGCTCTGCG ATCCGGTTCG GGATCCGAGC GGCGCGGCGC TGGCCCAGTG GATGCTGGGG ACCTCGATGC ATCTGGTGGG AGACCAACGC CTGGCGCAGG AAGCGGTGGA GCGGGGTTTC GGCTACTGGG CCGCTTCGGA CGCCGATGGC GCGGACTTCT TCGGATATGA TCACAAGGTG CGCGCGATGA TCGTGCTGTG CCGGGTCCTC TGGCTTCGCG GCGAACGCGC GCGATCGGCC GAGATGGCCG AGGCCGCCGT GGCCGAGGCC AAGCGTGGCG GCCGGCCGGT CAGCTTGTGC ATCGCGCTGA TCTATACGGC CACCGTGGCG TTGTGGGACG AGAATTGGGA CGAGGCCGAG CGTCTCGTAG CCCGGTTGAT CGAGCACGCC GCCCGACACG GGATCGGCCC CTATCACGGC GTGGGTCTGG CGCTGGAGGG GGAGCTGGAC ATCGGACGGG GGCATCTGCG CCAGGGCGTC GAAGCGCTTG GCCGAGCGCT CGAACGCCTC GATCGGGAAG CCCATCGTCT CCTTTCGCCG GCCTTCGCCG GAAGTCTCGC CAGGGGACTT CACGCGCTGG GAGAAGGCGC GAAGGCGCAG CGTCGCATTC TCGCGGCGCT GGACGAAACC AAGGCGGTTG GCGACCACTA TGATCTGCCC AGACTCTACC TGATCGCCAG CGAGATCGCT GGAGACGACC CTCGGGCTTC GAAGGCTTAC GCGATCCAGG CCCTTGAGGC CGCGCACGCG AGCTCGGCCC TGAGTCTGGA ACTGGCCGCG GCCATGAGGC TCTGGGCGCT GGAGGGCGAC AGCGCGGCCA CCCGTCAGCG CCTCGAAGGC CTGTGGAGGT TGGCGCCCGA TCACGACGCG CCCCTGGCGC GGCGCGCCCA CGCCTTATTG TGCGCTACGG CCTGA
|
Protein sequence | MGDARALSRG QACDESQAAF GPYLLAPGRR LLTRQGVPVA VGARALEILI ALLSEHGRVL THRQLVERAW AGLTVEESNL RVAVSGLRKT LGDGEGGARY IENVVGRGYC FVAPVRWLEA EPADASPSIV PAKLRGGSTR LPIPAARVIG RENVVEKLVA ALGDRALVTV VGAGGLGKTT VAALVAQALK TRAGVEAVFV DLSATADEAA LTKAVAVALG LDGPGRDLEE AILTCLANHR VLLVLDGCEH VVEAVAAFCD RLTRRLDLYR VLATSREALR VEAETVHLLE PLAFPSADGD PSLESQLGYS AVQLFVERAE VSGRRGPWTP VEARLAGDIC RRLDGVALAI ELVASRVGAL GLAGVADLLD DSLGLQWPGR RDAAPRHQTL QALLDWSHDL LGERDRRVLR RLSLFTGLFT LRDAHAVAGE PGDDTLDVAN ALTSLVDKSL VWTSHGGVGP VQFRLLDTTR VYALAKLTRS QETDVIAERH AGQVLARLTA DEVSDNASLF QRGAPLASAT VGDVLSALRW AQAARPRDLF VSLGVAAAPL LLSRAMLDEC EAWCLRALTA LPDERRGGLE ELRLLEAVGV ARMFGRGNHE TVREIIDRGM ALAQALGATT EQVHLLAGQH IFLTRIGDFS AALDCGRRCA ELCDPVRDPS GAALAQWMLG TSMHLVGDQR LAQEAVERGF GYWAASDADG ADFFGYDHKV RAMIVLCRVL WLRGERARSA EMAEAAVAEA KRGGRPVSLC IALIYTATVA LWDENWDEAE RLVARLIEHA ARHGIGPYHG VGLALEGELD IGRGHLRQGV EALGRALERL DREAHRLLSP AFAGSLARGL HALGEGAKAQ RRILAALDET KAVGDHYDLP RLYLIASEIA GDDPRASKAY AIQALEAAHA SSALSLELAA AMRLWALEGD SAATRQRLEG LWRLAPDHDA PLARRAHALL CATA
|
| |