Gene VC0395_A2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2083 
SymbolpepA 
ID5137864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2239728 
End bp2241239 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content50% 
IMG OID640533539 
Productleucyl aminopeptidase 
Protein accessionYP_001217999 
Protein GI147675524 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000600102 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTTCA GTGTTAAGAG TGGCAGCCCT GAGAAACAGC GCAGCGCATG TATCGTTGTT 
GGGGTGTTTG AACCACGTCG CCTTTCTCCA GTCGCAGAAC AGCTTGATAA AATCAGCGAC
GGCTATATTA GTTCACTGCT ACGTCGCGGT GATCTAGAGG GTAAACCGGG GCAGATGCTA
CTGCTGCATC AAGTACCCGG TGTGTTGTCT GAGCGAGTAC TGCTCGTCGG TTGCGGTAAA
GAACGCGAAC TGGGTGAACG TCAGTACAAA GAGATCATTC AGAAAACCAT CAATACCTTA
AATGAAACTG GCTCTATGGA AGCAGTCTGC TTCTTGACCG AGTTGCACGT CAAAGGTCGC
GATACCTATT GGAAAGTGCG CCAAGCGGTT GAAGCCACCA AAGATGGTCT GTACATCTTT
GATCAATTCA AGAGCGTAAA ACCAGAAATC CGCCGCCCAC TGCGTAAATT GGTATTCAAC
GTGCCCACTC GCCGTGAATT GAATCTTGGT GAACGCGCGA TTACCCATGG TCTGGCTATT
TCATCAGGTG TAAAAGCTTG TAAAGATTTA GGTAATATGC CGCCCAACAT CGCTAACCCG
GCTTACCTCG CCTCTCAAGC TCGTCGTCTG GCTGACGATT ACGAGAGCAT CACCACCAAA
ATCATTGGTG AAGAAGAGAT GGAAAAGCTC GGCATGGCTT CTTACCTCGC GGTCGGTCGT
GGCTCACGCA ATGAATCCAT GATGTCGGTC ATCGAATACA AAGGCAATCC AGATCCTGAA
GCCAAACCCA TCGTATTGGT GGGTAAAGGT CTGACTTTCG ATTCAGGCGG TATCTCACTC
AAACCGGGTG AAGGTATGGA TGAGATGAAG TACGACATGT GTGGCGCAGC ATCTGTTTTC
GGCACCATGA AAGCCATTGC CAAACTCGGC CTACCACTTA ACGTAATTGG TGTGTTGGCT
GGCTGTGAAA ACATGCCAGG CAGCAATGCT TACCGTCCGG GTGATATTCT GACGACGATG
TCAGGTCAAA CCGTAGAAGT GTTAAACACC GATGCAGAAG GTCGTTTAGT TTTGTGTGAC
GTACTGACTT ACGTTGAGCG TTTTGAGCCT GAATGCGTGG TCGATGTTGC AACGCTAACC
GGTGCGTGTG TGATTGCTTT AGGCCATCAC ATCAGCGCGG TGATGTCGAA CCACAACCCA
CTAGCACATG AGTTGGTGAA TGCCTCTGAG CAATCGAGCG ATCGCGCATG GCGTCTACCT
CTGGCAGACG AATACCATGA GCAGCTCAAG AGCCCGTTTG CCGATATGGC AAACATTGGT
GGCCGCCCAG GTGGCGCCAT TACTGCAGCT TGTTTCCTGT CTAAATTTGC TAAGAAATAC
AACTGGGCAC ACTTAGACAT CGCAGGTACT GCATGGAAAT CCGGTGCCGC GAAAGGCTCA
ACCGGTCGTC CTGTCTCACT ACTAGTCCAA TTCCTGCTTA ATCGCAGCGG TGGCCTAGAC
GCTGAAGAGT AA
 
Protein sequence
MEFSVKSGSP EKQRSACIVV GVFEPRRLSP VAEQLDKISD GYISSLLRRG DLEGKPGQML 
LLHQVPGVLS ERVLLVGCGK ERELGERQYK EIIQKTINTL NETGSMEAVC FLTELHVKGR
DTYWKVRQAV EATKDGLYIF DQFKSVKPEI RRPLRKLVFN VPTRRELNLG ERAITHGLAI
SSGVKACKDL GNMPPNIANP AYLASQARRL ADDYESITTK IIGEEEMEKL GMASYLAVGR
GSRNESMMSV IEYKGNPDPE AKPIVLVGKG LTFDSGGISL KPGEGMDEMK YDMCGAASVF
GTMKAIAKLG LPLNVIGVLA GCENMPGSNA YRPGDILTTM SGQTVEVLNT DAEGRLVLCD
VLTYVERFEP ECVVDVATLT GACVIALGHH ISAVMSNHNP LAHELVNASE QSSDRAWRLP
LADEYHEQLK SPFADMANIG GRPGGAITAA CFLSKFAKKY NWAHLDIAGT AWKSGAAKGS
TGRPVSLLVQ FLLNRSGGLD AEE