Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2083 |
Symbol | pepA |
ID | 5137864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 2239728 |
End bp | 2241239 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640533539 |
Product | leucyl aminopeptidase |
Protein accession | YP_001217999 |
Protein GI | 147675524 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000000600102 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTTCA GTGTTAAGAG TGGCAGCCCT GAGAAACAGC GCAGCGCATG TATCGTTGTT GGGGTGTTTG AACCACGTCG CCTTTCTCCA GTCGCAGAAC AGCTTGATAA AATCAGCGAC GGCTATATTA GTTCACTGCT ACGTCGCGGT GATCTAGAGG GTAAACCGGG GCAGATGCTA CTGCTGCATC AAGTACCCGG TGTGTTGTCT GAGCGAGTAC TGCTCGTCGG TTGCGGTAAA GAACGCGAAC TGGGTGAACG TCAGTACAAA GAGATCATTC AGAAAACCAT CAATACCTTA AATGAAACTG GCTCTATGGA AGCAGTCTGC TTCTTGACCG AGTTGCACGT CAAAGGTCGC GATACCTATT GGAAAGTGCG CCAAGCGGTT GAAGCCACCA AAGATGGTCT GTACATCTTT GATCAATTCA AGAGCGTAAA ACCAGAAATC CGCCGCCCAC TGCGTAAATT GGTATTCAAC GTGCCCACTC GCCGTGAATT GAATCTTGGT GAACGCGCGA TTACCCATGG TCTGGCTATT TCATCAGGTG TAAAAGCTTG TAAAGATTTA GGTAATATGC CGCCCAACAT CGCTAACCCG GCTTACCTCG CCTCTCAAGC TCGTCGTCTG GCTGACGATT ACGAGAGCAT CACCACCAAA ATCATTGGTG AAGAAGAGAT GGAAAAGCTC GGCATGGCTT CTTACCTCGC GGTCGGTCGT GGCTCACGCA ATGAATCCAT GATGTCGGTC ATCGAATACA AAGGCAATCC AGATCCTGAA GCCAAACCCA TCGTATTGGT GGGTAAAGGT CTGACTTTCG ATTCAGGCGG TATCTCACTC AAACCGGGTG AAGGTATGGA TGAGATGAAG TACGACATGT GTGGCGCAGC ATCTGTTTTC GGCACCATGA AAGCCATTGC CAAACTCGGC CTACCACTTA ACGTAATTGG TGTGTTGGCT GGCTGTGAAA ACATGCCAGG CAGCAATGCT TACCGTCCGG GTGATATTCT GACGACGATG TCAGGTCAAA CCGTAGAAGT GTTAAACACC GATGCAGAAG GTCGTTTAGT TTTGTGTGAC GTACTGACTT ACGTTGAGCG TTTTGAGCCT GAATGCGTGG TCGATGTTGC AACGCTAACC GGTGCGTGTG TGATTGCTTT AGGCCATCAC ATCAGCGCGG TGATGTCGAA CCACAACCCA CTAGCACATG AGTTGGTGAA TGCCTCTGAG CAATCGAGCG ATCGCGCATG GCGTCTACCT CTGGCAGACG AATACCATGA GCAGCTCAAG AGCCCGTTTG CCGATATGGC AAACATTGGT GGCCGCCCAG GTGGCGCCAT TACTGCAGCT TGTTTCCTGT CTAAATTTGC TAAGAAATAC AACTGGGCAC ACTTAGACAT CGCAGGTACT GCATGGAAAT CCGGTGCCGC GAAAGGCTCA ACCGGTCGTC CTGTCTCACT ACTAGTCCAA TTCCTGCTTA ATCGCAGCGG TGGCCTAGAC GCTGAAGAGT AA
|
Protein sequence | MEFSVKSGSP EKQRSACIVV GVFEPRRLSP VAEQLDKISD GYISSLLRRG DLEGKPGQML LLHQVPGVLS ERVLLVGCGK ERELGERQYK EIIQKTINTL NETGSMEAVC FLTELHVKGR DTYWKVRQAV EATKDGLYIF DQFKSVKPEI RRPLRKLVFN VPTRRELNLG ERAITHGLAI SSGVKACKDL GNMPPNIANP AYLASQARRL ADDYESITTK IIGEEEMEKL GMASYLAVGR GSRNESMMSV IEYKGNPDPE AKPIVLVGKG LTFDSGGISL KPGEGMDEMK YDMCGAASVF GTMKAIAKLG LPLNVIGVLA GCENMPGSNA YRPGDILTTM SGQTVEVLNT DAEGRLVLCD VLTYVERFEP ECVVDVATLT GACVIALGHH ISAVMSNHNP LAHELVNASE QSSDRAWRLP LADEYHEQLK SPFADMANIG GRPGGAITAA CFLSKFAKKY NWAHLDIAGT AWKSGAAKGS TGRPVSLLVQ FLLNRSGGLD AEE
|
| |