Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2229 |
Symbol | |
ID | 4072974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2647198 |
End bp | 2649642 |
Gene Length | 2445 bp |
Protein Length | 814 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984245 |
Product | aminopeptidase N-like |
Protein accession | YP_591304 |
Protein GI | 94969256 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.325655 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.574229 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATCC CCTTTGTGCG CCGTTGCGTC CTCTTTTTCT GCCTTGTAAG CTCCACCCTC TTCGCGCAAT TTGCCCCGAA TGCCCACCCG ATTTACCAGG CCTTGCGCAC CGGCGGGTTC GGGCAGGAAG CCTATTCCGT CCAGAACCTG CTGCTGAAGC GCGGTGGTGC GACGCTGAAG CTCGATGGCA AAATATGGCT GCTTGGCCCT GTGAATAACA AACATATCGG CTTGGCCTTC ACCGGCACCG GCATACTCTC GGTAACGCCG CCGACGCCCT CCGAGCAACG CCAACTTCGC CTTTTCTCAC GCGATGTCGA GTTCGTAGAG CATTTCGACG AGATGATGCT CTTCTTCACC GACAATACCT ACGACGAAGT CAAAGCCGCC GGACAGCCCG ACAGTTCTCC CGCGCCGGCT GGAGTTCTAA GCGACGCCCG CAAACGGCTC CGCGACTCGC TGCGTTACAA CCTTTACGGG CGTCTCCAGC AAGACGTGAT GGCACAGCAG CCCGGCGGGC TGTTCTTCGC GATGATCAAG GGCAAGCACT ACAGCGGCAA GATGCTGTTT GTAGTCGATC CGCATGGCGT GCCGTCGATC GATCCGCTGG AAGCGCCGCC ACTGGACGTA GCTCCCGAAG AAGTGGCCGT CACCGTCTAC GACGAAATGC ACGGCGGCGT CTGGACCGCC TACTACCTTC CGCAGGAGTA CATGGACCAC ACTGCGAAGG GCACGCAGAA GAACGACACG TTTTTCATTG AGCACCAGGA CATCGACGCC ACGATGGAGA AGAGTGGCAA GCTCGATGGC GTGGTGAAGA CCACGATCAT TTCCAACAAC GATGAACTGA ACGTCGTGCG CTTTGAACTT TTCACTACCT TGCGAGTTTC GAAGGTCGTT GACTCAAGTG GCAACGCCAT GCACTTCATG CAGGAGGACA AGGAACACGA CGCGCAGTTC TATGTCGTCT TGCCGCACGC GTTGAAGATG AACGAGAAGT TCGACATCGT TACCACCTAC AGCGGCAAGG ACGCGATCCG CAACGAAGGC GGCGACAACT ACTATCCCCT CGCCCGCGAT AACTGGTATC CCAACAAGCC GTTTGGCGAA TACGAAACCT ACGACATGAA ATTCCACATC CCCAAGGGGA TGACCATGGT TGCCACCGGC ATACCAGTGA GCAGCGGCGA AGAAAACGGC TGGGCTGTCT CGGTGTGGAA GAGCGCGGTG CCGCAGGCGG TGGCGGGGTT CAACTTCGGC CGATTCAAGA AGGACCAGGC TCAGCTCAAG ACGCGCAACA ATTTCCAGAT TGAGTCTTAC ACCAACATCA ACCCGCCCGA CATTGTGCAG GCGATCCAAC ATGCTGCCGA GCCCGGCATG TCACTGGATG GTTCCCACGG TTCCGCGGCT ACACTCGGCA CAATGGACAC CCGCTCGTTC GGGCCCAAGG CGCTTGCCGA GGCTGAGATG GCCACGGATC TGTATTGGCA GTACTTCGGA CCCATTCCGT ATCAGCGGCT TGCGATGACC CAGCAGACGG CCAGCAACTA CGGACAGGCG TGGCCCGGGC TGGTCTACCT GCCCATCACC TACTTCTTCG ACACCACGGT CCGACATCAG CTCGGTATGA GCGAGGCCAA GGGATACTTC CGCATCGTCG CCTCGCATGA GGTGGCGCAC CAGTGGTGGG GACACGCCGT CGGCTTTAAG TCGTATCGCG ATCAGTGGAT GAGCGAGGGC TTCGCGGAGT GCTCGGCTTC GATCTACACT CAGATGGTCA ATAAAAAGCC CGATGAGTTC CGCAAATTCT GGTCGGATGA ACACGAGCTA CTCGTGCAGA AGAACAAGGA AGGCGTGCGG CCGATCGACG TCGGCCCGGT CACGCTCGGA TATCGTCTGC TGAATGGCCG CACCGGCTAC GACGTGCCGC GGCGCCTGAT CTATCCGAAG GGCGCCTACA TCCTGCACAT GGTCCGCATG ATGATGTGGA ACTCGAAAAG TGGCGACGAA CTGTTCCAGA AGATGATGAC CGACTTCGTG CAGACCCATT ACAACAACGT GGCTTCCACT GAAGACTTCA AGACGGCGGT TGAGAAGTAC ATGACGCAGG GTATGGACGT GGACGGCAAC CACACCATGG ATTGGTTCTT CAACGAATAC GTCTACGGAA CGGCGCTGCC GGCCTATAGC TTCGAGTCTT CCTTTGTGGA TGGACCGAAT GGCACGACGC TGCTGAAGTT CAAACTGACG CAGTCAAAGG TGGACGATGC GTTCGACATG ATTGTTCCGG TTTACATCGA GTTGCCGAAT GGCCATGTGC CGAGGCTGGG CTCCATCGGC GTTCGTGGAA ACAACAGCGT TGAACAAACG GTGAATCTCG GGCCGCTGAA AGAACGGCCC AAACGCGCGC TCATTAATTG GAACTACGAC GTATTGTCGG AATAG
|
Protein sequence | MSIPFVRRCV LFFCLVSSTL FAQFAPNAHP IYQALRTGGF GQEAYSVQNL LLKRGGATLK LDGKIWLLGP VNNKHIGLAF TGTGILSVTP PTPSEQRQLR LFSRDVEFVE HFDEMMLFFT DNTYDEVKAA GQPDSSPAPA GVLSDARKRL RDSLRYNLYG RLQQDVMAQQ PGGLFFAMIK GKHYSGKMLF VVDPHGVPSI DPLEAPPLDV APEEVAVTVY DEMHGGVWTA YYLPQEYMDH TAKGTQKNDT FFIEHQDIDA TMEKSGKLDG VVKTTIISNN DELNVVRFEL FTTLRVSKVV DSSGNAMHFM QEDKEHDAQF YVVLPHALKM NEKFDIVTTY SGKDAIRNEG GDNYYPLARD NWYPNKPFGE YETYDMKFHI PKGMTMVATG IPVSSGEENG WAVSVWKSAV PQAVAGFNFG RFKKDQAQLK TRNNFQIESY TNINPPDIVQ AIQHAAEPGM SLDGSHGSAA TLGTMDTRSF GPKALAEAEM ATDLYWQYFG PIPYQRLAMT QQTASNYGQA WPGLVYLPIT YFFDTTVRHQ LGMSEAKGYF RIVASHEVAH QWWGHAVGFK SYRDQWMSEG FAECSASIYT QMVNKKPDEF RKFWSDEHEL LVQKNKEGVR PIDVGPVTLG YRLLNGRTGY DVPRRLIYPK GAYILHMVRM MMWNSKSGDE LFQKMMTDFV QTHYNNVAST EDFKTAVEKY MTQGMDVDGN HTMDWFFNEY VYGTALPAYS FESSFVDGPN GTTLLKFKLT QSKVDDAFDM IVPVYIELPN GHVPRLGSIG VRGNNSVEQT VNLGPLKERP KRALINWNYD VLSE
|
| |