Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3500 |
Symbol | |
ID | 5900955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3779149 |
End bp | 3781848 |
Gene Length | 2700 bp |
Protein Length | 899 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641564006 |
Product | hypothetical protein |
Protein accession | YP_001685125 |
Protein GI | 167647462 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.512789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTATCA CCACCAACGA AATGCTTGCC GATCTTGGCG GGCGCATTGT CGATCTCATC CAAGAGAATC TGACGCCCCC GCAGTTCATC ACCGCGCTTA GCGGTCTGGC GACCAATTGG GATGGTGGAA CACTGTCAGC CGTTGGTCTG GGGCGCCAAC TCTCCGACAG CATCGAAGCT CGTAACCGCT GGGTCGATCA GGCTTCCGCA TACTTCCAGG GCACCGCGAC AGGCGGACCA AACAGCGACG GCAAGTACCC GTTCACGACA CGCACAGGCG CGACCGTTTC GATGGAATGT CCAGCCAAAC TAGCAGCCAT GGTCACGGGC CCATCGGAGA GCGCTCAAGC CTATGCTGCA TCTGCTCTTG CTGCTCGCGA TGTCATTCTA GGGAAGGTCG CTGAAGCAGC GGCATCCGCG ACCGCAGCGG CAGGCAGCGC TACGGCGTCA GCAGGTAGCG CAACAGCGGC AGCAGCTAGC GCCACGACAG CCGGTACGGC GAAGACCGCT TCTGAGACAG CTCGCGATGT CACGCAGGGC TATCGAGACA CCACCCTAAC GGCCAAGACC GCGACCGAGA CGGCACGCGA TCTAACCCTC ACATACCGCG ACGGCGCACT AACGGCTAAG GACGCCGCGG TGACTGCTAA GACGGCTTCT GAGAGCGCGC GTGACGCAGC TATCGCAGCG GCATCGTCCG TTGATACCAC GGCCATCAAC AACAACCTGG CCCTCAAGTT CGACAAGGGC GGCGGGACGC TGACAGGCGA CCTCTACCTG ACGGCCAGCG GTTCGCAGTC TCCCGGCTTC CACCTCAAGG CAAACGGCAT GGGAACCGAC GCGAAGATCA GCCGCGCCTA CTCACAGGGC GGACTCTTCA CGTTGGACTT CGTGAACGAC GCCTACACGG CGGCGCAGCC ATTCCTGACC GTTGGACGCT CGGGCACCAC GCCTCAGGCA ATCAACCTGT TCGGTACATC GCTGAGCTTC AACAGCACGG CGATCACGAC CGCAACGACT CTGGCGAGCG GACTAGCGAC CAAGCAAAAC ACCCTCGGCT TCACACCGGT TCAACAAGGT ACAGGCGTCG GTCAAACCAC CAACGTAATC AAGTTGGGCT GGTCCAACGA GGGCAAGCTG AAGGCAACGA TCGATGCGAC TGACCAGGGC GCTATCGTGT TTGAAAGCGC GCTGACTTGG ACAAACCTAT CCGGCAAGCC GTCGAGCTTC GCGTCGGATT GGTCCACGCT CACAAGCAAG CCATCGACCT TTGCCCCATC GGCTCACACT CACGTCATTG CCGATACGAC CGGACTTCAG GCCGCGCTTG ATGCGAAGCT CGCGACCACA GGGTTCACCT ACACAGCGCT ACCGGGCAAG CCGTCGCTCT ACCCAACCGA CACGGCCAAC GTCTCAGGTC TGACAGCAGC CCTCGCACTC AAGGCAGACG CCTCGGCGTT GACTAGCGGT CTTGCCGCCA AGGCTCCAAT CGCCAGTCCG CAATTCACGG GCACGGCTAA CATCACAGGT GCAGCAGCAA CGACACGACT TTTTGGAGCT CAGACAGCTG GTGTCCTGCG TTGGATGTGG GGTGCGGCTG CAGACACCGA AAGCGGCTCC AATGCGGGTT CAAATTGGGC GCTCTACAGC TACGCGGACA ATGGCGCGTT TATCGGTACT CCGATCTCCG TGACCCGTGC GTCCGGCGCA GTTACGTTCG CGGGAGCCGC GACGTTCAAC AGCACCGTAT CCATCGGCGG CGCTACGCCT TGGACATCGG CCAACTTCAC GCCTTCCACC AAGCTCGATA CGTCGGCCTT CACGTGGGCA AACCTGAGCA GCAAACCAAC GACATTTGCG CCTTCGGCTC ACACTCACGC CACGTCTGAG ATTACCGGCT TAGATACTGC CCTTGCGGGG AAAGCTGCCC TGTCGGGCGC GACCTTCACT GGTGCAGTCG CCATGAACTC GACCCTCGCG GTCAGTGGAA ACGTCCGTGC GATTAGCACG GGCGGCACGG GACAGTTTAC CGCTGTCAGC GGCAATACAA TGGCCGGGTT CTACCAAGAC AGCACGAATT TCTACCTTCT GAAGAGCGCG ACTTCTAACA CCACGTTCGA CGGGCATCGA CCTATTGTGG TCAGCCTCTC GACGGGTTCG GTGACCATCG ACGGCACAGG CGCTGGCGGC ACTCAGGTTG GCGGTACATT GGGCGTCACC GGCACGCTCA ATTGCGCCGG TGAAATCTAC ACACCCGGCT GGATCAGGCT AACGGGCAAC CAGGGCATGT ACTGGAATGC CTGGGGCGGC GGTTGGACGA TGACCGACAG CACGTGGATG CGGTCATATG GCGACAAGTC CATCCTGACG GGCGGCAACA TCCAGTGCGC GATGTGGACC GTGACTTCCG ACGAACGCCT GAAGACCGAC ATCAAGCCGC TGACCAACGG CAGCGAGATC ATCTACGGAA CGAACGTCTA TTCGTTCATC AAGGGCGGTC AACGCATGTG GGGTGTCCTG GCTCAAGAAG CCCAGGCAAA TCCCCTCACT GAGGTTCTAG TCAACGAAGG CGGGCAGCTT CTACCAGACG GCAGCGGCAA CGCCCTGACC GTCGATAGCA TGGGCTACGT CTATGCCCTG ATCGACACGG TCAAGGAGCA GAACGCTCGC ATTGCGGCGC TAGAAGCGAG GCTCGCATAA
|
Protein sequence | MTITTNEMLA DLGGRIVDLI QENLTPPQFI TALSGLATNW DGGTLSAVGL GRQLSDSIEA RNRWVDQASA YFQGTATGGP NSDGKYPFTT RTGATVSMEC PAKLAAMVTG PSESAQAYAA SALAARDVIL GKVAEAAASA TAAAGSATAS AGSATAAAAS ATTAGTAKTA SETARDVTQG YRDTTLTAKT ATETARDLTL TYRDGALTAK DAAVTAKTAS ESARDAAIAA ASSVDTTAIN NNLALKFDKG GGTLTGDLYL TASGSQSPGF HLKANGMGTD AKISRAYSQG GLFTLDFVND AYTAAQPFLT VGRSGTTPQA INLFGTSLSF NSTAITTATT LASGLATKQN TLGFTPVQQG TGVGQTTNVI KLGWSNEGKL KATIDATDQG AIVFESALTW TNLSGKPSSF ASDWSTLTSK PSTFAPSAHT HVIADTTGLQ AALDAKLATT GFTYTALPGK PSLYPTDTAN VSGLTAALAL KADASALTSG LAAKAPIASP QFTGTANITG AAATTRLFGA QTAGVLRWMW GAAADTESGS NAGSNWALYS YADNGAFIGT PISVTRASGA VTFAGAATFN STVSIGGATP WTSANFTPST KLDTSAFTWA NLSSKPTTFA PSAHTHATSE ITGLDTALAG KAALSGATFT GAVAMNSTLA VSGNVRAIST GGTGQFTAVS GNTMAGFYQD STNFYLLKSA TSNTTFDGHR PIVVSLSTGS VTIDGTGAGG TQVGGTLGVT GTLNCAGEIY TPGWIRLTGN QGMYWNAWGG GWTMTDSTWM RSYGDKSILT GGNIQCAMWT VTSDERLKTD IKPLTNGSEI IYGTNVYSFI KGGQRMWGVL AQEAQANPLT EVLVNEGGQL LPDGSGNALT VDSMGYVYAL IDTVKEQNAR IAALEARLA
|
| |