Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1674 |
Symbol | |
ID | 5899129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1761257 |
End bp | 1763521 |
Gene Length | 2265 bp |
Protein Length | 754 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641562164 |
Product | hypothetical protein |
Protein accession | YP_001683301 |
Protein GI | 167645638 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.204775 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCGCG CGCCGCGATA TCGCGCCTTC ATCAGCTATT CGAGCGCCGA CCGTTCGACG GGCGAGGCCT TCCAGGCCGC GATCGAGCGC TTTCGCGTCC CCAAGCCGTT GCGGGGCCGG ACGACCGCCC GAGGCCTGGT TCCAAAGTCC CTGGCCCCGG TGTTTCGGGA CCGGTCAGAC GCCGACGCCA GCCAGAACCT TTCGGCGTTG TTGCTGGACG CCCTGGCGGC GTCCGAGGCC TTGATCGTCC TGTGCTCGCC CCTGGCGGCC CAGTCGAAAT GGGTCAATCG CGAGATCGCG GCGTTCAAGC GCCTGCGGCC CGGCGCGCCC GTCCTGCCGG TGATCGTCGG CGGGCGGAGC GGCCGATACG ACCCCGCGCT GCGGCCGGAA GGCGCCTTTC CCCCGGCGCT CTACGACCGG ATCGAACCGG ACGGCCAGAT GCTGGTCGGC GCCGAGCCGG AACCCCTGGC GCCGGATTGG CGCAAGACCG GAGACGGGCC GCATTTCACG GTCCTCAAGA TCGCCGCCGC CCTGACCGGC ATCCGGTTGA CCGAACTGAC CCAGCGCCAG GCCGAGGCCG AGCGGCGTGA ACGCAACATC GCCCGGGCGA TCGCGGCGGG CATGACCGTC CTGGCCCTGG TCGCCGTCGC CGGCGGCGTG CTGGCCTGGA TCGGCATGGA CGCCGCGCGC AAGCGGTTGT CCGACGTGGT CGACATGGCC GCCGCCCAGG TGCAGGACGC CGCCGATATC CGCGACAGCT ACGGCGTGCC GGGCGCGGTG ATACGCGACC TGCTCGACAA GAGCGGAACG CGGCTGAACG CCATCGTCGC CAAGGCGAAG GTCGATACGC CGCAAATGCG GCTGAGTCGG GCCGGCCTGG CCCAGGCCTA TGCGCAATAC GCGGCCGACG CGGCGCCGAC CCAGGAGCGC CAGGCCCGGG CGGCCTTGGC GGGGCTGGAC ACACTGGACC GCCCCAGCCT GCCCTATCGC CTCGCCCACC TGCTGGACGC GCCGACGGAC CCCCTGGCTC GCGCCCTGCT GCGCCAGGAG GCGCTGCGCA CACTGGCGGT CGCGCTCTCG GTCCAGCCCG GCAAGATGGC CGAGGCCGAA GCGGTGATGC GCCAGGCGCT GGAGGCGTCG GCGCGGATGG CGCTCGGTGA TCGCGAGCCG CAGACCGCCG AGAACTATCG GACCCTGGCC GGAATCCGCT ACCAGCGGTC GGGCGACTTG GCCAAGGCCC TGGAGGCGCA GGACAAGGCC CTGGCGGCGC TGAAGGACCA GCGCGAGGGC CAGGACCCCG AGGTCGACTA TCAGATCGCC GCCCTGCGCA GCGACCGCGC CGAAACCCTG CTCGAATTCC AGCGTATCCC CGAAGCCCTG GCCGAGCAGG AGCGCGCGGT CGCCCTGCTG CGACGCGTGG CGGCGCGCGA TCCGGGCGAC ACCCATTTCG CGGTCGCGCT GGCGGCGACC CTCGGGCGCG TCGCCGACGT GCGGGCCACG GCGCTCAATG ACGGGCCGGG CAGCCTGACC CTCTATGAGG ATGCCCTGGC CATGCAGGAG CGGCTACACG CCGGCGATCC CCTACGCACC GACTACATCC AGGACCTGAC CGCCACCCTC GAGCGCATCG TCGACGTGCT GGTCGAGGCG CCCCGCCAGG ACCTGGCGCG GATCACCGCC CTGCAGGAGC GCGTGGTCGC CCTGCGCCGG AGCCTCGTGG CCCGCGACGC GTCCGACGCG GGGCTGCGGC GCGATCTGGC GGTGGCGCTT GAGCGAGCCG GCGACGTCGC GGTGGAGCGG CATGATCTCG GCGCGGCGCG GCGGGCCTAT GACGAGTCCT TCGCCCTGCG GTCGGCGCTC CGCTCGGAGC GTGGAAACGA GGACGAGGCG CGGCTGGTGG CGACCCACGA CTACGCCCAG GCGCTGATCC GCCAGGGCGG GTTCGCGGCC TTGACCGGCC GACCCCTGCC GGAGATCGAG CCACGCTACC GCGCCGCCAT CGCCGAGATG ACGCCCTATC TGGGCGACTC GCGGTTCTCG CCGCAATGGC GCTTCGAGGT CGCCACCTGG GAGCTGGCCC TGGCCGACGT CCTGGCCAAG CGCGGCCGGT CGGCGGACGC CCTGACGCTG AGACGCGGCG CGGCGGCGCG TCTCGACGCG TTGGTCGCCG AGTATCCCGA CCAACCGATG TACGACCAAT GGCGCAAGGG GGTGCGCCGG CGTTTGGGGG CCGCGTTCAG AGGAACACAA GCCAAGGAGG GCTGA
|
Protein sequence | MGRAPRYRAF ISYSSADRST GEAFQAAIER FRVPKPLRGR TTARGLVPKS LAPVFRDRSD ADASQNLSAL LLDALAASEA LIVLCSPLAA QSKWVNREIA AFKRLRPGAP VLPVIVGGRS GRYDPALRPE GAFPPALYDR IEPDGQMLVG AEPEPLAPDW RKTGDGPHFT VLKIAAALTG IRLTELTQRQ AEAERRERNI ARAIAAGMTV LALVAVAGGV LAWIGMDAAR KRLSDVVDMA AAQVQDAADI RDSYGVPGAV IRDLLDKSGT RLNAIVAKAK VDTPQMRLSR AGLAQAYAQY AADAAPTQER QARAALAGLD TLDRPSLPYR LAHLLDAPTD PLARALLRQE ALRTLAVALS VQPGKMAEAE AVMRQALEAS ARMALGDREP QTAENYRTLA GIRYQRSGDL AKALEAQDKA LAALKDQREG QDPEVDYQIA ALRSDRAETL LEFQRIPEAL AEQERAVALL RRVAARDPGD THFAVALAAT LGRVADVRAT ALNDGPGSLT LYEDALAMQE RLHAGDPLRT DYIQDLTATL ERIVDVLVEA PRQDLARITA LQERVVALRR SLVARDASDA GLRRDLAVAL ERAGDVAVER HDLGAARRAY DESFALRSAL RSERGNEDEA RLVATHDYAQ ALIRQGGFAA LTGRPLPEIE PRYRAAIAEM TPYLGDSRFS PQWRFEVATW ELALADVLAK RGRSADALTL RRGAAARLDA LVAEYPDQPM YDQWRKGVRR RLGAAFRGTQ AKEG
|
| |