Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0074 |
Symbol | |
ID | 5897786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 87536 |
End bp | 91186 |
Gene Length | 3651 bp |
Protein Length | 1216 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641560557 |
Product | hypothetical protein |
Protein accession | YP_001681710 |
Protein GI | 167644047 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATGG GAATGGGGCG CGCGCGAATG TTCGGAAAAG TCGCCGCGTT CGAGTTTCGC TACCAGACGC GCCAGCCGGT CTTCTGGGTC GGGGTGATCA TCTTCTTCCT GCTCTCGTTC GCCTCGGTCG CAGCGCCTGA GTTCGTGCAG TTTGGATCGA CGGCCAATGT TCACAAGAAC GCCGCCTCGG TGATCGCCAG CGGCAACCTG AACCTGGCCC TGATCTACAT GTTCATCACG GCCGCCTTCG TGGCCAACAT CATCGTGCGC GACGAGGACA CCGGCTTCGA CGGCATCCTG CGCTCCACCC CGCTGACCAA GTTCGACTAT CTGTACGGCC GGTTCACCGG GGCCTTCGCG GCGGCGGCGC TGTCGTTCCT GGCCGTGCCC GCCGGCATGG CCCTGGGCGC GGTCATGTGG TGGGTGGACC CCGAGAGCGT CGGCCCGTTC GTCCTGAACC ACTACCTGTT CGCCTATTTC GTGCTGGCCC TGCCGTCGCT GCTGTTGACC TCGGCGGTGT TCTTCGCGGC GACCACGGTG ACACGGTCGA TGATGTGGAC CTATGTCGGC GTCATCGCCT TCATGGTGGC CTCGTTCATC AGTTCGACCC TGTTGCGCCA GCCCGGCCTG GAGAAGGTCG CCGCGCTCTG GGAGCCCTTC GGCGGCGGCG CCTATGGCCT CGCGATCCGC TACTGGACAG CGGCCGAGCG CAACACCCAG ATCCCGCCCC TGGCCGGCTA TCTGCTGGCC AACCGGCTGA TCTGGATCGG CATCGCCCTG GCGATCATCA CCGCCGCCTA CTGGCTGTTC GACCTGCGCA AGGGCGATGG GGCGACCCGG GTGAAGCGCG AGAAGCCGGG TCAGGCCGAG ACCCCGCGTC CGACCGGAGT CACGCACGGC GCGCCCCGCT TTGACGCGGC CTCGCGCTGG GCGCAGCTCT GGGCGCGGGC GCGGCTCGAC GCGCGGCAGG TGTTCGGCAG CCCGGCCTAT CTGGTGCTGG TCGGCCTGGC CGCCGCCCTG TCGGTGTTCA ACCTCTGGCG GGTCGCCGAT GACAGCCTCT ATGGGGGGGC GATCTATCCA GTGACCCGGG CGATGATCCA GGCCTTGAAC GGCATCTTTC CGTTCATTTC GCTGATCATC GCCTCGTTCT ACGCCGGCGA GCTGGTCTGG CGCGAGCAGG ACCGCAAGAC CCACGAGCTG ATCGACGTCA CGCCGATCCC CGACTGGGCC TTCGTCGTGC CCAAGACCCT GGCCATCAGT CTCGTCCTGC TGTCGACCTT CCTGGCCGGC GTGGCCGTCT CGGTGATCAT CCAGCTGATC AAGGGCTACA CCCACCTGGA GCTGGTGAAC TACATCGTCT GGTGGGTGCT GCCCCAGGCC GCCGACTGCA TCCTGATGGC TGTGCTGGCG GTGTTTATCC AGGTGCTGTC GCCGCACAAG TTCGTCGGCT GGGGTCTGAT GGTGCTCTAC ATCATCGCCC TGATCGTCGC CTCCAGCTGG GGTCTGGAGC ACAATCTGTA TCTCTACGAC GGCTCACCGA TGGTGCCGAT CTCGGACTTC AACGGGCAGA GCCGGTTCTG GATCGGCGCC TGGTGGTTCC GGCTCTACTG GACGGCCTTC GCGGTGGTGC TGCTGGTGCT GTGCCACGTG CTGTGGCGGC GCGGAACCGA GAGCCGGTTG CGCCCGCGAC TGGCCGCGCT GCCGCGCCGC CTGATGGGCG GGCCGGGCGT GATCGCGGGC GTCGCCCTCG TGGCGTTCAT CGGCGTCGGA GCGTTCATCT ACATCAACAC CAACGTCTGG AACGCGTACC GCACGACTCT GAGCAACGAG CGCTGGGCCG CCGACTACGA AAAGGCCCTG CTGCCGTTCG AGACCACGCC GCAGCCGAAG ATCGTCGCGG TCAAACTGGA CGTCGATATC CGGCCGGGCG CCCCGCGCAT CGACACCAAG GGCGTCTACG AGATCGAGAA CCGCACGGAC AAGCCGTTGC GCGAGATTCA CGTCCGCTTC GATCGCGACC TCAAGGTCCT GGCCCTGTCG ATCGAAGGCG CGCGGCCCAA ACGGACCTTC GAGCGGTTCA ACTATCGCAT CTTCGCCTTC GACACCCCGA TGCAGCCGGG CGAGCGGCGC TCGATGTCGT TCAGCACCGC CCGGGTGGCG ATGGGCTTCC GCAACAGCGG CGCCGACACC CTGATGGTAG ATAACGGGAC CTTCATCAAC GACGGCCAGC TGGCCCCGTC TCTGGGCATG GACCGCAACG GCCTGCTGAC CGACCGCAGC AAGCGGCGCA AGTACGGGCT GAAGCCCGAA CTGCGCCCGG CCAAGCTGGG CGACGTCGCG TCGCGCCAGT TCAACGGCCT GCGCCGCGAC AGCGACTGGG TGACTTCTGA CATCACCGTC ACCACCGACG CCGACCAGAC GCCGATGGCG CCGGGCTACA AGGTTTCGGA CACGACGTGG CATGAATCGG CCAACATGAG CGGCCCGGTC AACAAGGCCG GCGACCGGCG CACCGCGCGG TTCGTGACCG AAGCCCCGAT CCTGCATTTC TTCTCGATCC AGTCGGCCCG CTACGCGCTG AGGACCGAGC TCTACAAGGG CGTGCGTCTG TCGGTCTACT ACCACCCGGC CCACGCCTGG AACGTGGAGC GGATGATCAG CTCGATGAAG CGGTCGCTGG ACTACGACCA GGCCAATTTC AGCCCCTACC AGTTCCGTCA GTTGCGTTAT CTGGAGTTCC CGGCCTACGG CAATTTCGCC CAGTCGTTCG CCAACACCAT TCCCTGGTCC GAGAACCTGG GCTTCGTGTC CAAGTACGAG GACCCGACCA AGATCGACAT GGTCACCTAT GTCGGCGCGC ATGAGATCGC CCACCAATGG TGGGCGCACC AACTGATCGG CGCCGACCAG CAGGGCGGCG CGGCCTTGGC CGAGACCCTG GCTCAGTACT CGGCGCTGAT GGTGATGAAG AAGATCTACG GCGAGCCGAT GATCCGCAAA TTCCTCAAGT ACGAGCTGGA CCGCTATCTG CGGGCTCGGG GCGGCGAGGT GATCGAGGAG CTGCCGCTGC GCCAGGTCGA AGACCAGCCC TACATCTACT ACAACAAGGG CTCGCTGGTG ATGTACCGGC TGGCCAGCGA GATTGGCGAG GACAACGTCA ACGCCGCCCT GCGCGACATG CTGGCCGCCT ACGCCTTCAA GGGTCCGCCC TATCCGACCA CCCTGGAACT GGTCGCGGCC CTGCGCCGCC ACGCCCCCGC CGACAAGCAG GCCCTGATCA CCGACCTGTT CGAGAAGATC ACGCTGTACG ACCTGAAGAC CACGGCGGCG ACGGTGAAGA AGCGCCCCGA TGGCCGGTTC GACGTCACCC TGACGGTGAC GGCCAAGAAG CTCTACGCCG AGGGCCGCGG CCAGGAGAAG GAAGCGCCGA TGAGCGAGCC GATGGACATC GGCCTGTTCA CGCTCGAGCC GGGCAAGAAG GGTTTCGGGG CCGACAAGGT CGTGGCCTTC GAGCGCCGGA CCATCACGTC GGGGACCCAG ACCCTGGGCT TCGTCACGTC GGTCGCGCCG AAGGCGGCGG GGGTGGATCC CTACAACATG GTCATTGACC GCAATGGTGA CGACAACATC ACCAAGGTGG AGATGAGATG A
|
Protein sequence | MRMGMGRARM FGKVAAFEFR YQTRQPVFWV GVIIFFLLSF ASVAAPEFVQ FGSTANVHKN AASVIASGNL NLALIYMFIT AAFVANIIVR DEDTGFDGIL RSTPLTKFDY LYGRFTGAFA AAALSFLAVP AGMALGAVMW WVDPESVGPF VLNHYLFAYF VLALPSLLLT SAVFFAATTV TRSMMWTYVG VIAFMVASFI SSTLLRQPGL EKVAALWEPF GGGAYGLAIR YWTAAERNTQ IPPLAGYLLA NRLIWIGIAL AIITAAYWLF DLRKGDGATR VKREKPGQAE TPRPTGVTHG APRFDAASRW AQLWARARLD ARQVFGSPAY LVLVGLAAAL SVFNLWRVAD DSLYGGAIYP VTRAMIQALN GIFPFISLII ASFYAGELVW REQDRKTHEL IDVTPIPDWA FVVPKTLAIS LVLLSTFLAG VAVSVIIQLI KGYTHLELVN YIVWWVLPQA ADCILMAVLA VFIQVLSPHK FVGWGLMVLY IIALIVASSW GLEHNLYLYD GSPMVPISDF NGQSRFWIGA WWFRLYWTAF AVVLLVLCHV LWRRGTESRL RPRLAALPRR LMGGPGVIAG VALVAFIGVG AFIYINTNVW NAYRTTLSNE RWAADYEKAL LPFETTPQPK IVAVKLDVDI RPGAPRIDTK GVYEIENRTD KPLREIHVRF DRDLKVLALS IEGARPKRTF ERFNYRIFAF DTPMQPGERR SMSFSTARVA MGFRNSGADT LMVDNGTFIN DGQLAPSLGM DRNGLLTDRS KRRKYGLKPE LRPAKLGDVA SRQFNGLRRD SDWVTSDITV TTDADQTPMA PGYKVSDTTW HESANMSGPV NKAGDRRTAR FVTEAPILHF FSIQSARYAL RTELYKGVRL SVYYHPAHAW NVERMISSMK RSLDYDQANF SPYQFRQLRY LEFPAYGNFA QSFANTIPWS ENLGFVSKYE DPTKIDMVTY VGAHEIAHQW WAHQLIGADQ QGGAALAETL AQYSALMVMK KIYGEPMIRK FLKYELDRYL RARGGEVIEE LPLRQVEDQP YIYYNKGSLV MYRLASEIGE DNVNAALRDM LAAYAFKGPP YPTTLELVAA LRRHAPADKQ ALITDLFEKI TLYDLKTTAA TVKKRPDGRF DVTLTVTAKK LYAEGRGQEK EAPMSEPMDI GLFTLEPGKK GFGADKVVAF ERRTITSGTQ TLGFVTSVAP KAAGVDPYNM VIDRNGDDNI TKVEMR
|
| |