Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3802 |
Symbol | |
ID | 5901264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4117831 |
End bp | 4120149 |
Gene Length | 2319 bp |
Protein Length | 772 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641564324 |
Product | hypothetical protein |
Protein accession | YP_001685426 |
Protein GI | 167647763 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0133094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00280043 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGCCG AGAGCGCGCA AATCGAACTT CCGGCACAGC ACGAGCGTCA GGCGCACGGT CCTTGGAGCG ACCTAGCGCT CCATACCATC GGTTGGCGCG CGTTTCAGGA TCTCTGCTCG CAGGTATGCG AGGTCGTGCT CGGCCAGCCC GTGGAAATCT TCCGCGAAGC TCAGGACGGT GGGCAGGACG CGGTTTTTCT CATCCCCTCA GGAAGCGACG CGCCGCCGAT CGGTACGGTC CAGTGCAAGC ATACGTCGGA GGCCGCAAAG GCCCTGAAGG CGAGCGATCT CACCGCCGAG ATCGATAACG TCGAAGAGCT GGTGAAGGCC GGCCAGGCAG ACACCTACGC CTTCATGACC AATATGAGCG TGGATGCACC CGTCGCGGCC GCCATGCGCG CCCGGCTTCG CGCGCTTGGC GTGCGCAAGC CGCACATTCT CGGCCGCCAG TACATCGTTC GGGTCATCAA GAGCAGTGCG CGCCTTCGTG CGCTTGTCCC GCAGGTTTAC GGCCTTGGGG ATCTAACATC GATCGTCGAT GAGAGGCTCA GCGAACAGAG CCGTGCGCTG CTCGACAGCT GGATTCCGAA ACTCCGCACC TACGTTCCCA CTAAAGCCCA CCGCGACGCG ATCAACGCGA TTTCCAACCA TGGCGTGGTG CTGCTGCTCG GCAATCCGTC CAGCGGTAAG TCTGCGATTG GGGCCATCAT CTCGACAATC GCTTCGGAAA ACCCCGCCAA CACCGTCCTC GCTTTGACCA GCCCTCGCGA TTTCGAGGCG GGCTGGAATC CAAACGACCC GGGCCGCTTC TTCTGGATCG ACGACGCTTT CGGCTCGAAT GTGCTGCGCA ACGATTTTGT GCAGGACTGG ACGTCGGCCT TCTCGAAGCT GAGGGCGGCG ATCAAGCACG GTAACCGATT TCTCCTGACC TCCCGCAAGC ACATCTACGA AGCGGCGCGA CGCCGGCTGG GGCAGCGCAA CCTTGCGCAG TTCGCTGATG GTAGCGCTGT TGTCGATGTC GGCGAGCTGA TCTTTGAGGA GAAGGCGCAG ATCCTCTACA ACCATTTGAA TTTTGGCGAG CAGAGCCAAA GCTGGCGTTC AACCGTCAAG CCCCACCTCG CCGCTGTCGC TGCTGTTCGC GACTTCCTCC CCGGCATCGC CGAGCGTCTC GGCGACCCGA ACTTCACCAA GGGCTTGGCG CCGCGCGAAA GTTCCCTCGT TCGATTTATG GAGGAGCCGA CGGAACATCT GATCGACACC GTCAACGCCT TGGACGATCA GCTGCAAGCC GCGCTCATCC TTGTTTATGT TCACCAGACT GGGTTCGATC CTAGCGATTA CGATGCTTCG GCCGCACAAG CGGTCGCAGA ACTGACTGGC TACACTCTCA CCAAGATTCA GGATTGCTTC GCCGAGCTGA AAGGCTCGTT CCTGAAACTC TCCGGTTCAA AATGGACTTT CGCACATCCC ACGATCTCCG ACGCCCTGAC CGAGATTCTG CGCCAGAAGC CACATATGAT GGCGGCGCTC ATAAGGGGCG CGACTATCGA CACCATTCTC AGCAGTTTTA CGTGCGAGGG GTCGCCTCTT ATTCGAGACG CCCTTCTCAT ACCCGCAACA CTCGACGACG CTTTGGTCGC TCGGCTCGGC CACACACCAG ATGAATGGCA CCGCAATTGG ATGCTGTTCC ACTTTTTGTC TTATCGCGCC AACGAACACG TGTTCGTCAG TGCAGTTCAA CAATTTCCGC AACTACTTCG GCGGTCCTCC TGGGACTCCG ATCTGGTTAG CAACGATCCT CATGTCGCCA CATACGCGCG TGCCCATCGC CTCAACCTGT TGCCCGACGA CCTGCGCTCG GAGGCGGCGA ACAAGCTAGA ATCTGCCGTC CTCAACGATC TCGACGTCTC CTTCTTCGAC GAGCCGGAGA TGTTGGCTCT GATACCGCCG CTGAGCCTTA TCGGCGTTGG CTTGGCGTTG CGGACGACTG TGCTGCCGTC GCTTGAAGAG CGGATCGCCG AGATCGCCGC GGATGCCGAT CTGGACGAAG AGCCTGACAG CCACTTCAAG AAGCTTCTCG GCGTGCTTGA TTGCGTAGAG GCGATCGGCA TCGACGCGGA CTCCACCGTC TTGATCGATG ACACACGCGA TCAGGTGAGA CGGTCAATCA AGGCACTTGA AGAGCGCAAG CGGGAGCGCG ACGAAGAGTC CGACGACGAC ACAGATTGGA CTCACATCGT AACGCAGAAG AAGGATGATA CCCCCGCGCC ACCTGCTGCC GCCACGAAGC GCTCAGTGTT CGATGATGTC GATAAATAG
|
Protein sequence | MTAESAQIEL PAQHERQAHG PWSDLALHTI GWRAFQDLCS QVCEVVLGQP VEIFREAQDG GQDAVFLIPS GSDAPPIGTV QCKHTSEAAK ALKASDLTAE IDNVEELVKA GQADTYAFMT NMSVDAPVAA AMRARLRALG VRKPHILGRQ YIVRVIKSSA RLRALVPQVY GLGDLTSIVD ERLSEQSRAL LDSWIPKLRT YVPTKAHRDA INAISNHGVV LLLGNPSSGK SAIGAIISTI ASENPANTVL ALTSPRDFEA GWNPNDPGRF FWIDDAFGSN VLRNDFVQDW TSAFSKLRAA IKHGNRFLLT SRKHIYEAAR RRLGQRNLAQ FADGSAVVDV GELIFEEKAQ ILYNHLNFGE QSQSWRSTVK PHLAAVAAVR DFLPGIAERL GDPNFTKGLA PRESSLVRFM EEPTEHLIDT VNALDDQLQA ALILVYVHQT GFDPSDYDAS AAQAVAELTG YTLTKIQDCF AELKGSFLKL SGSKWTFAHP TISDALTEIL RQKPHMMAAL IRGATIDTIL SSFTCEGSPL IRDALLIPAT LDDALVARLG HTPDEWHRNW MLFHFLSYRA NEHVFVSAVQ QFPQLLRRSS WDSDLVSNDP HVATYARAHR LNLLPDDLRS EAANKLESAV LNDLDVSFFD EPEMLALIPP LSLIGVGLAL RTTVLPSLEE RIAEIAADAD LDEEPDSHFK KLLGVLDCVE AIGIDADSTV LIDDTRDQVR RSIKALEERK RERDEESDDD TDWTHIVTQK KDDTPAPPAA ATKRSVFDDV DK
|
| |