Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3592 |
Symbol | |
ID | 5901047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3876652 |
End bp | 3878598 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564102 |
Product | peptidase M1 membrane alanine aminopeptidase |
Protein accession | YP_001685217 |
Protein GI | 167647554 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.757001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTAG ACGGAACCGT CCTGCCCATG CCCTCTCTCA AGACCACCAC CCGCGCCGCC CTGCTGGCGC TCCTGCTCTG CGGCGCGGCC GCCGCGCCCG TCCTGGCGCA AACCGCTCAA CCCCCGATCC CGGCCATCCT GATGACGCCC GAGGCCCGCG ACATCCACTC CTACGCCCAG CCGCTGGTGG CCCGGGTCAC CCATGTCGAC CTGGACCTGA CCGCCGACTT CGCCGGCCAG AAGATGACCG GCACGGCCGC CCTCGACATC GCCGCCGCGC CGGACGCCGA GGAGGTGGTG CTCGACAGCA AGGGCCTGGT GATCCACGGC GTCACCGACG ACAAGGGCGC GGCCCTGCCG TGGACCCTGG GCAAGGCTGA CCCGATCCTG GGCGCGCCGC TGACGGTGCA GCTGCCCAAG GGGGCCGGAG CCGCCAAGCG CATCGTCATC AGCTATGACA GCGCCCCCGG CGGCGCGGCC CTGCAATGGC TGACCCCGGC CCAGACGGCG GGCAAGATCA AGCCCTATCT GTTCAGCCAG GGCGAGGCGA TCCTCAACCG CACCTGGATC CCCACCCAGG ACAGCCCGGG CGTCCGCCAG ACCTGGACCG CCCGCATCGT CGCGCCCGAG GGCCTCAAGG CCGTCATGAG CGCCGAGATG CTGACCCCCA ACGGCGAGCC CGTCGCTGGC GGCCGCGCCT ATCGCTTCAA GATGGACAAG CCAGTCGCCT CGTACCTGAT CGCCATCGCC ATCGGCGACA TCGCCTTCAC CCCGCTGGGC CAGCGGACCG GCGTCTACAC CGAGCCGTCG GTGATGAAGA AGACCGCCAA CGAACTGGTC GATGTCGAGA AGATGGTCGA GGCCGCCGAG AGCCTCTACG GCCCCTATGC CTGGGGCCGC TACGACCTGC TGGTCCTGCC GCCGTCGTTC CCGTTCGGCG GCATGGAGAA CCCCCGCCTG ACCTTCGCCA CGCCCACGAT CATCGCCGGC GACCGCTCGC TGGTCAGCCT GGTGGCGCAT GAGCTGGCCC ACTCGTGGTC GGGCAACCTG GTGAACAACG CCACCTGGTC GGACTTCTGG CTGAACGAGG GCTTCACCGA CTATTTCGAA AACCGGATCA TGGAGAAGCT CTACGGCAAG GACCGCGCCG ACATGCTGGC CGATCTGGGC TGGAGCGACC TGCAGGGCGC GATCAAGGAC GCCGGCGGGT TGAGCGGCGC CGACACCCGC CTGCACCTGG ACCTGACCGG CCGCGATCCC GACGACGGCA TGACCGACAT CGCCTATCAG AAGGGCGCGA CCTTCCTGCG CACCATCGAA AAGGCGGTCG GCCGCGCGCG CTGGGACGCC TATCTCAAGG CCTATTTCGC CCGGCACGCC TTCCAGAGCC AGACCACGGC CGGCTTCGTG GCCGACCTGC GCGAGAACCT GATCAAGGGC GACCCGAAGC TCGAAGCCGC GATCGGCATC GACAAGTGGG TCTATGACGT GGGGCTGCCG GACAACGCCG TGCACATCCA TTCTGCGGCC TTCCCGGCGG TGGACGCCTT GGCCGCCGCC TACGCCAAGG GCGGCCCCGC GCCGATCGCC AGGTGGAAGG CCTGGAGCAC GCCCGAGCGC ACGCGCTTCA TCGCCAGCCT GCCCCGCGCC CTGCCGAAGG CGCGCCTGGC CGCGCTCGAC AAGGCCTTCG GCCTGTCGGC CCAGGGCAAC AGCGAGATCC GCTTCGTCTG GCTGGAACTG GCCGTCGCCA ACCGCTACGA CCCCGCCATG CCGTCTCTGC AGGCCTTCCT GACCGACCAG GGCCGCCGCA AGTTCGTCGC CCCGCTGTTC AAGGACCTGA TGGCCCAGGG CGACTGGGGC CAGCCGATCG CCAAGGCGCT CTACGCCAAG ACCCGGCCGC TCTATCACGC GGTCACGCGC CAGACGGTCG ACGGGATCGT GAAATAG
|
Protein sequence | MSVDGTVLPM PSLKTTTRAA LLALLLCGAA AAPVLAQTAQ PPIPAILMTP EARDIHSYAQ PLVARVTHVD LDLTADFAGQ KMTGTAALDI AAAPDAEEVV LDSKGLVIHG VTDDKGAALP WTLGKADPIL GAPLTVQLPK GAGAAKRIVI SYDSAPGGAA LQWLTPAQTA GKIKPYLFSQ GEAILNRTWI PTQDSPGVRQ TWTARIVAPE GLKAVMSAEM LTPNGEPVAG GRAYRFKMDK PVASYLIAIA IGDIAFTPLG QRTGVYTEPS VMKKTANELV DVEKMVEAAE SLYGPYAWGR YDLLVLPPSF PFGGMENPRL TFATPTIIAG DRSLVSLVAH ELAHSWSGNL VNNATWSDFW LNEGFTDYFE NRIMEKLYGK DRADMLADLG WSDLQGAIKD AGGLSGADTR LHLDLTGRDP DDGMTDIAYQ KGATFLRTIE KAVGRARWDA YLKAYFARHA FQSQTTAGFV ADLRENLIKG DPKLEAAIGI DKWVYDVGLP DNAVHIHSAA FPAVDALAAA YAKGGPAPIA RWKAWSTPER TRFIASLPRA LPKARLAALD KAFGLSAQGN SEIRFVWLEL AVANRYDPAM PSLQAFLTDQ GRRKFVAPLF KDLMAQGDWG QPIAKALYAK TRPLYHAVTR QTVDGIVK
|
| |