Gene Veis_1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_1923 
Symbol 
ID4691340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp2157709 
End bp2159748 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content65% 
IMG OID639849690 
Productoligopeptidase A 
Protein accessionYP_996694 
Protein GI121608887 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0492523 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAACC CCCTTCTGAC CCCCTCGGAC CTGCCCCCCT TCGACCGCAT CGGCGCGCAG 
GATGTCGCGC CCGCCATCGA TGTGCTGCTG GAGCGCGCCA GCCAGGCACT GGAGACCGTC
ACCGCCCCCG GCTTCCCGGC CCGCTGGCAA GCCATCATGG AGCAGTTGGA AGTGCCCAAC
GAGCAACTGC TCCGGGCCTG GGGCGCCATC AGCCATTTGA ACAGCGTGGC CGACACCCCG
CCACTGCGGG CCGCCTACAA CGCCGCCCTG CCGCGCATCA CCGAATTCTG GACCCGCCTG
GGCACCAACG AGCGGCTGTA CGCCAAGTAC AAGGCCATCG ACCCCGCGTC ACTGACGCCC
GTGCAAAGCC AGGCCCACCG CAACGCCATG CGCAACTTCG TGCTCGCCGG CGCAGAACTG
ACCGGGGCCG CCAAAGAACG CTTCGCCTGC ATCCAGCAGC GCCAGGCCGA ACTGGACCAG
AAATTCAGCG AAAACGTCCT CGACGCCACC GAAGCCTTTG CCTATTACGC CAGCGCCGAA
GAACTCGACG GCCTGCCGGC CGACATACGG CAAGCAGCCC TGTCCGCCGC GCAGGCCGAG
GGGCGGGCGG GCTACAAACT GACACTCAAA TGGCCCTGCT ACCTGCCGGT AATGCAATTT
GCCAGCCGCA GCGAACTGCG CGAAAAACTC TACCGCGCCT ATGTCACCCG CGCCTCCGAC
CAGGCCGAGA ACCGACAGTT CGACAACAGC GCATCGATCG CCGAGATTCT CGCGCTGCGC
CGCGAAGAAG CCCGACTGCT TGGCTACCCC CATTTCGGCG CGCTGTCGAT CGCGCCCAAA
ATGGCGCAGT CGCCCGCCGA AGTGATCGGC TTTTTGCGCG ACCTGGCGCG CCGGGCGCGC
CCCCATGCAG AAAAAGACCT GGCCGACCTG CGCGCCTTTG CGGCCAGGCA CAAGGGCCTG
GACGACCCCC AGGCCTGGGA CTGGTACTAC CTGTCCGAGC AGCTCAAACA GGCCCGCTAC
GCCTTCAGCG AGCAAGAGGT CAAGCAATAC TTCACCGCCC CCAAGGTGCT GGCCGGACTG
TTCAAAATCG TCGAGACATT GTTTGAAGTC TCGATTCGCC CGGACTCGGC CGCCGTATGG
GATGCCGCAG TCGAGTTTTA CCGCATCGAA CGCGGCGGCC GACTGATCGG CCAGTTCTAC
CTCGACCAAC CCGCCCGCAC CGGCAAACGG GGCGGCGCCT GGATGGACAA TGTGCGCAAC
CGCTGGCTAC GCCCCGACAC CGGCGCGCTG CAAACGCCGG TGGCGCACCT GGTCTGCAAC
TTCGCGCCCG GCGTGGATGG CCGGCCGCCA CTGCTGACCC ACTACGATGT CATCACCCTG
TTCCACGAAT TTGGCCATGG GCTGCAGCAC CTGCTGACCC AGGTCGATGA ATTCTATGTC
TCGGGCATCA GCGGCGTGGA ATGGGACGCC GTGGAACTAC CCAGCCAGTT CATGGAAAAC
TTCTGCTGGG AATGGGATGT GCTCGGGCAC ATGACAGCCC ATGTGGACAC AGGCGCGCCG
CTGCCCCGGG AGCTGTTCGA CAAAATGCTC GCGGCGCGCA ACTTCCAAAG CGGCATGCAG
ACCCTGCGCC AAATCGAATT CGCGTTGTTC GACATGCTGC TGCACACCGA GCATGACCCG
GCCGAGGACT TCATGCCGCT GCTGGAGCGG GTGCGCAACG AAGTGTCGGT GCTGGCCTCG
CCCCCATTCA ACCGCCCCGC GCACAGCTTT AGCCACATCT TTGCCGGCGG CTATGCGGCG
GGCTACTACA GCTACCACTG GGCCGAAGTG CTCAGCGCCG ATGCCTATGC CGCCTTTGAA
GAAACCGCAG GCCAAAATGG CCAGCCCAGC ATCGAAACCG GCCGACGCTA TCGCCAAACC
ATTCTGGAAA CGGGCGGCAG CCGCCCGGCA ATGGAATCGT TCAAAGCCTT CCGCGGCCGT
GCGCCCAGCC TGGATGCCCT GCTGCGCCAC CGCGGCATGA TCGAAGAATC GACCGCCTGA
 
Protein sequence
MDNPLLTPSD LPPFDRIGAQ DVAPAIDVLL ERASQALETV TAPGFPARWQ AIMEQLEVPN 
EQLLRAWGAI SHLNSVADTP PLRAAYNAAL PRITEFWTRL GTNERLYAKY KAIDPASLTP
VQSQAHRNAM RNFVLAGAEL TGAAKERFAC IQQRQAELDQ KFSENVLDAT EAFAYYASAE
ELDGLPADIR QAALSAAQAE GRAGYKLTLK WPCYLPVMQF ASRSELREKL YRAYVTRASD
QAENRQFDNS ASIAEILALR REEARLLGYP HFGALSIAPK MAQSPAEVIG FLRDLARRAR
PHAEKDLADL RAFAARHKGL DDPQAWDWYY LSEQLKQARY AFSEQEVKQY FTAPKVLAGL
FKIVETLFEV SIRPDSAAVW DAAVEFYRIE RGGRLIGQFY LDQPARTGKR GGAWMDNVRN
RWLRPDTGAL QTPVAHLVCN FAPGVDGRPP LLTHYDVITL FHEFGHGLQH LLTQVDEFYV
SGISGVEWDA VELPSQFMEN FCWEWDVLGH MTAHVDTGAP LPRELFDKML AARNFQSGMQ
TLRQIEFALF DMLLHTEHDP AEDFMPLLER VRNEVSVLAS PPFNRPAHSF SHIFAGGYAA
GYYSYHWAEV LSADAYAAFE ETAGQNGQPS IETGRRYRQT ILETGGSRPA MESFKAFRGR
APSLDALLRH RGMIEESTA