Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3199 |
Symbol | |
ID | 5900654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3459669 |
End bp | 3461498 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641563704 |
Product | peptidase M1 membrane alanine aminopeptidase |
Protein accession | YP_001684824 |
Protein GI | 167647161 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.179758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.906786 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCCGA AAATCGCGTC GAGGGGGTTC GCTTTGCCTA TTCAGTTTCA CCGGCCGTTC CATCGGACCG CGGCCCTCTG CGCCGTCTCA TTGCTGGCCT TGACGGCCGT CGCTCACGCC GCCCAAGCCG AAAGTCCGGC CAAGCCGTCC CCGACCACCG CCTTGACCCT GGGCACCGGC GGCCAGATGC CCGCCGAGGA AGCGGCGGTG ACGCTTGAGC ACGTCGACCT GAAGCTCAAG ATCATCCCCG AGCGCAAGGC GATTGATGGC GACGCCACCC TGACCCTGGC GGCCAAGAGC CCCCTGCCCC GCATCGTCCT GGACTTCGAC AAGAACTTCA CGGTCAGCGC CCTGATCGTC GATGGCAAGA CCCTGCCCGC CTCGGCCTGG ACCAATCCGG AAGGCCGGCT GACCATCAAC CTCCCAGCCC CGATCGCCGC TGGCGCCAGG ACCGTGGTGC GGATCCGCTA TGCCGGCGCG CCGCACGAGG CCGAGAAAGC GCCGTGGGAC GGCGGCTTCG TCTGGAAGAC CACGCCGACC GGCGAGCCCT GGATCGCCAG CGCCGTGCAG GGCGACGGCT GCGACCTGTT CTGGCCGTGC ATCGACTTTC CGACCGGCGA GCCGCTTCTG GTCGATTTCT ACATCACCGT CCCCGCCCCG CTGTCGGCCC CCGCCGGCGG CGTGTTGGTC GGCGTCGAGG AGAAGGACGG CTGGCGCACC TTCCACTGGC GTCAGAAACA GCCCGACACC TACGCCATCG CGGTCAATGT CGGTCCCTAT GACAAGCTGG AAGCTGCCTA CAAGAGCCGG TTCGGCGACA GCTTTCCGAT CGAGTACTGG TATCTGAAGA GCGACGATCC GGCCAAGGCC AAGGCCCTGT TTGCCGAGTT TCCGACCACG CTGGACTTCT TCGAGCAGAT GATCGGCCCC TACCCGTTCC GGTCCGAGAA GCTGGGCGTC GTCGAGACCC CGCACCTGGG CATGGAACAC CAGACCATGA ACGCCTACGG CAACGAGTAC CGCAAGGACG TGTTCGGCTA CGACTGGCTG TTCCAGCATG AGCTGTCGCA CGAGTGGTTC GGCAACCAGG TGACCAATGT CGATTGGGAC GACATGTGGA TCCATGAAGG CCTGGGCAGC TACATGCAGC CGCTGTTCTC GCAGTGGCTG CACGGCGACA TGGAGTACAT GACGCGGCTG AACGCCCAGC GGGTCGGCAG CAAGAACCAG TTCCCGATCG TCTCCGACAA GGTGATGACC GAGGATCAGG TCTACAAACC CGAAGGCGGC CCGGCGAACG ACATCTACGC CAAGGGCTCG AACGTCATGC ACACCCTGCG GGCGACGATC GGCGACGAGG CGTTCTTCAA GTCGGTGCGC ACCCTGGTCT ATGGCCGCCC GGACCCCAAG CCCGGCAATT TCGCCCCGCG CTACGCCACG ACCAAGGACT TCATCGCGAT CGTCAACAGC GTAACCGGCA AGGACTATCA GTGGTTCTTC GACGCCTACT TCTACCAGGC CAAGCTGCCG GAACTGCGCG AGACCCGCGA CGGCGACGAT CTGGTGCTGA GCTGGAAGAC CCCCTCGGGC AAGGCCTTCC CGATGCCTGT CGAGGTCAAG GTCGGCGACA AGGTCGTCAC CGCCCCGATG GCCGACGGCA CGGGCCGGAT CAAGGTCGGC GACGCCGTGC CGGTGATCGT CGATCCCGCG TCCAAGATCC TGCGCCGCCA GCCCTATCTG GAAGACTATC AGGCCTGGAA AAAGGCCGCG GACGAAGCCG CCAAGAAGGC CGAAGAGGCC AAGAAGGCGG CGACCGCGAA GAAGTCGTAG
|
Protein sequence | MSPKIASRGF ALPIQFHRPF HRTAALCAVS LLALTAVAHA AQAESPAKPS PTTALTLGTG GQMPAEEAAV TLEHVDLKLK IIPERKAIDG DATLTLAAKS PLPRIVLDFD KNFTVSALIV DGKTLPASAW TNPEGRLTIN LPAPIAAGAR TVVRIRYAGA PHEAEKAPWD GGFVWKTTPT GEPWIASAVQ GDGCDLFWPC IDFPTGEPLL VDFYITVPAP LSAPAGGVLV GVEEKDGWRT FHWRQKQPDT YAIAVNVGPY DKLEAAYKSR FGDSFPIEYW YLKSDDPAKA KALFAEFPTT LDFFEQMIGP YPFRSEKLGV VETPHLGMEH QTMNAYGNEY RKDVFGYDWL FQHELSHEWF GNQVTNVDWD DMWIHEGLGS YMQPLFSQWL HGDMEYMTRL NAQRVGSKNQ FPIVSDKVMT EDQVYKPEGG PANDIYAKGS NVMHTLRATI GDEAFFKSVR TLVYGRPDPK PGNFAPRYAT TKDFIAIVNS VTGKDYQWFF DAYFYQAKLP ELRETRDGDD LVLSWKTPSG KAFPMPVEVK VGDKVVTAPM ADGTGRIKVG DAVPVIVDPA SKILRRQPYL EDYQAWKKAA DEAAKKAEEA KKAATAKKS
|
| |