Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2447 |
Symbol | |
ID | 5135206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2601161 |
End bp | 2602954 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640533899 |
Product | aminopeptidase P |
Protein accession | YP_001218347 |
Protein GI | 147674850 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAACC TACACTCTCA ACGCCTCGCG GATTTTCGCC ACTGGCTACA CACCCAGCAA CTTGATGCCT TCATTGTTCC TCATGAAGAT GAATATCTTG GGGAATACGT TCCTGAGCAT AATGAGCGCT TACACTGGTT AACTGGGTTT ACAGGTTCCG CGGGAGCCGC CATCGTTACC CTCTCCGGCG CAGCGATTTT TGTTGATGGC CGCTATACCG TTCAAGTCCG CAAACAAGTG TCTAGTGAGC TATTCGAATA TTGCCACTTG ATCGAACAAC CTTATTTAAA CTGGCTCGTC ACTCAACTGC CCGCAGGAGC GAAAGTGGGT TACGATCCTC GCATGCACCG TGGTAGCTGG CTAACACAGG CACAAAAACA GCTCGCAGGA AAAATTAACT TGTGTGCAGT CAGCAGTAAC CCGATTGATT TACTCTGGCA GGATCGTCCA GTTCCCGCAG CCTCTGAAAT GCGCTTAATA CCTCTTGATC GTGTTGGACA AAGCAGCCTT GAAAAGCGTC AATCGATCGC CAGCACTCTG CGCGACAAAA ACGCCGATTG CGTCGTGTTG ACCGAACTAG ATTCTATCGC ATGGCTACTC AATATCCGCG GTTTAGATGT ATCTCGCTTA CCCGTGTTGC TTTCTCACGC CATTGTCCAT AACGACAGCA GTGTGGATTT CTTTTTCGAT CCGGCGCGCT TGGCAACCGA TTTTGATGCC CATGTAGCAG GTACGGTTCG CGTGCATCAT CCGGATCAGC TTGAAGCTCA ACTGCATCAA CTCTCTGGTC GCCGCGTCAT GCTCGATTCT GCGACCAGCA ATGCATGGTT CACACTCACC TTACAAAATG CGGGCGCTGA ACTGCTCAAC GAAGCGGATC CATGCCTCCT GCCGAAAGCG GCAAAAAATA ACACGGAAAT TGCAGGCATG CGTGCCTGTC ATATTCGCGA TGGTGCAGCT ATGGTGCAAT TTCTCGCTTG GCTGGACAAT GAAGTGGCTA ATGGTCGCCT GCATAATGAG GCCGAGCTGG CGGATCGACT TGAAGCCTTC CGTCGCCAAG ACCCAACCTT GGTTGACTTG AGTTTTGATA CGATTTCAGC AGCTGGTACA AACGCCGCTA TGTGCCACTA CAACCACCAA AATCAGCCTG AGCCGGGTCA GCTTTCTATG GATAGCCTGT ATTTAGTCGA TTCTGGTGGC CAGTACCTTG ATGGAACCAC AGACATTACT CGTACGGTGG CCATTGGCCA GGTCAGCGCA GAGATGAAGC AACAATTCAC CTTAGTACTC AAAGGTCACA TTGCCTTGGC TCGCGCTCGC TTCCCGAAAG GCACAACAGG TTCACAACTT GATGTTTTGG CCAGACAGCA CTTATGGGCA CAAGGTTACG ACTACGATCA TGGTACTGGC CATGGTGTCG GCCATTTCTT AAGTGTACAT GAAGGGCCAC AGCGTATCGC GAAGGTTCAC AATAGTGTGG CTTTGCGCCC CGGAATGGTG CTTTCCAATG AGCCGGGTTA TTACCGTGCC GATGCGTTCG GCATTCGAAT CGAGAACTTA GAGCTGGTCA CCGAGTTTGC AACTCAGGGC GATTTCTCTG TACTTGGTTT TGAATCATTG ACTCGTTGCC CTATCGATAA ACGTGCGATT GAGGTGAATT TGCTGACTAA GCCGGAACTC CATTGGCTCA ATCAGTATCA TCAAAAAGTG TGGGATGAAG TGAGTCCGCT TATCAAAGAG GCTCACGTCC GTGAGTGGTT GCAACAAGCC ACCTCACCCT TGAGTCACGT ATAA
|
Protein sequence | MSNLHSQRLA DFRHWLHTQQ LDAFIVPHED EYLGEYVPEH NERLHWLTGF TGSAGAAIVT LSGAAIFVDG RYTVQVRKQV SSELFEYCHL IEQPYLNWLV TQLPAGAKVG YDPRMHRGSW LTQAQKQLAG KINLCAVSSN PIDLLWQDRP VPAASEMRLI PLDRVGQSSL EKRQSIASTL RDKNADCVVL TELDSIAWLL NIRGLDVSRL PVLLSHAIVH NDSSVDFFFD PARLATDFDA HVAGTVRVHH PDQLEAQLHQ LSGRRVMLDS ATSNAWFTLT LQNAGAELLN EADPCLLPKA AKNNTEIAGM RACHIRDGAA MVQFLAWLDN EVANGRLHNE AELADRLEAF RRQDPTLVDL SFDTISAAGT NAAMCHYNHQ NQPEPGQLSM DSLYLVDSGG QYLDGTTDIT RTVAIGQVSA EMKQQFTLVL KGHIALARAR FPKGTTGSQL DVLARQHLWA QGYDYDHGTG HGVGHFLSVH EGPQRIAKVH NSVALRPGMV LSNEPGYYRA DAFGIRIENL ELVTEFATQG DFSVLGFESL TRCPIDKRAI EVNLLTKPEL HWLNQYHQKV WDEVSPLIKE AHVREWLQQA TSPLSHV
|
| |