Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3656 |
Symbol | |
ID | 4072259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4326083 |
End bp | 4327198 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985679 |
Product | aminopeptidase DmpA |
Protein accession | YP_592731 |
Protein GI | 94970683 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.950122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCCC ACATCTTCTC TGCTCTCCTG CTAGCGTCAT CGCTGGTCTG CGCCCAGAAA CCTCGCGCCC GCGATCTTGG CGTTCCTTTC GACGGAACTC CTGGCAAGTT CAATGCGATC ACCGATGTCG CCGGCGTAAC AGTAGGCCAC AAGACGCTGA TCGAAGGCAC CGACATTCGC ACTGGCGTGA CCGCGGTGAT TCCGCGCGCG AACGATTTGT TTGATCCGGT GTATGCGGGA TGGTTTTCGC AGAACGGTAA TGGCGAGATG ACGGGAACGA CCTGGGTGGA AGAGTCGGGA TTCCTTGATG GCCCGGTAGT GATCACCAAC ACGCACAGTG TTGGTGTGGT GCGCGATGCG GTGATCGCAT GGCGCCTGAA CCACCAGCCG CCGAAAACGG TGGAGGATGC GTGGTCGCTG CCGGTTGTCG CGGAAACGTG GGACGGCTGG CTGAACGATA TCAACGGATT CCACGTGAAG CCGCAAGACG CGTTCGATGC GCTCGACAGT GCGAAGTCAG GCCCGGTGCT GGAAGGCGCT GTCGGCGGCG GAACGGGGAT GATTTGTAAC GAGTTCAAGG GCGGGATTGG GACGTCTTCG CGCATGCTCG ACGCCAAGGC CGGAGGCTAT ACGGTTGGGG TGCTGGTGCA GTGTAATTAC GGGCTGCGTC CGCAGCTGCG CATTGCTGGT GTTCCGGTGG GTAAGGAGAT TCCACAGCAC GCTGCCTATG AAGAAGAGAA AGGCTCGATC ATCGTCGTGG TTGCGACCGA TGCTCCGCTG ATTCCGACGC AACTCAAACG ACTTGCGCGC CGCGTAACCG ATGGGCTGGG GCGGCTGGGA AGCATCTCGG GCAATGGGTC GGGAGATATC TTCATTGCCT TCTCCACGGC GAATCCGCAC ACCTACTACA GCGAGAAGGC AGCGACGATA CAGACGCTGC CGCTCGACAA CATGGATCCG TTATTTGCAG CGACCGTGCA GGCGACGGAA GAAGCAGTGG TGAATGCGCT CGTCGCCGCA GAAGACATGA CGGGGTACAA GGGGCGGAAG GTGATCGCGC TGCCGCACGA TCAATTGCGA GAAGTGTTGA AGAAGTACAA CCGGCTGGAA AAGTAA
|
Protein sequence | MKAHIFSALL LASSLVCAQK PRARDLGVPF DGTPGKFNAI TDVAGVTVGH KTLIEGTDIR TGVTAVIPRA NDLFDPVYAG WFSQNGNGEM TGTTWVEESG FLDGPVVITN THSVGVVRDA VIAWRLNHQP PKTVEDAWSL PVVAETWDGW LNDINGFHVK PQDAFDALDS AKSGPVLEGA VGGGTGMICN EFKGGIGTSS RMLDAKAGGY TVGVLVQCNY GLRPQLRIAG VPVGKEIPQH AAYEEEKGSI IVVVATDAPL IPTQLKRLAR RVTDGLGRLG SISGNGSGDI FIAFSTANPH TYYSEKAATI QTLPLDNMDP LFAATVQATE EAVVNALVAA EDMTGYKGRK VIALPHDQLR EVLKKYNRLE K
|
| |