Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Paes_1222 |
Symbol | |
ID | 6459853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prosthecochloris aestuarii DSM 271 |
Kingdom | Bacteria |
Replicon accession | NC_011059 |
Strand | - |
Start bp | 1330536 |
End bp | 1331996 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642725212 |
Product | aminoacyl-histidine dipeptidase |
Protein accession | YP_002015897 |
Protein GI | 194334037 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01893] aminoacyl-histidine dipeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.121021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000871052 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATGATG ATATTCTTTC CTTAGAGCCC CAGGCGGTAT GGAAACATTT TTACGGCCTG ACCCGGATAG CCCGTCCTTC AGGCAACGAG GAGAAAGTCC GGAGATTTAT TGCTGGTTTC GGGCAGTCTC TTGGATTGGA AACGATTGTT GACGAGTCGG GTAACGTTCT TATCCGTAAA CCGGCGACAT CCGGCATGGC TGATCGGAAG GGCGTTATTC TTCAGGCTCA TGTTGATATG GTTCCCCAGA AGAACAGTGG TACGGTTCAC GATTTTGACA AAGATCCTAT TGAACCCATT GTCGATGGTC AATGGGTGCG GGCAAGGGGA ACGACGCTCG GAGCCGATAA CGGCATTGGC GTTGCAGCGA TCATGGCTGT GCTGGAATCG ACGGAAATGC GTCATGGCCC TCTTGAAGCG CTCTTTACCA GTAACGAAGA GAGCGGTATG ACCGGTGCAT TCGGACTGAA ATCCGGTGTG CTCAGGGGCA GCATTCTTCT CAACCTGGAT TCGGAAAATG AGGGAGAGCT GTGTATCGGT TGCGCTGGCG GCCTCGATGC AACGATGAGG TTCAGCTACG AAGAGCAGGC CGTCCCGGAT GGACACACTG GGTTCACGAT CAGTGTGGGC GGTTTGAGAG GCGGTCACAG TGGCATGGAT ATTGCACTTG GGCGCGGCAA TGCAAACAAG ATCCTGACCC GCCTGCTCTC TACCGGATAT ACATGTCATG ATATGCTGTT GGCCTCGATC GACGGAGGTA GCCTTCGCAA TGCAATTCCC CGAGAATCGA ATGCAACGGT CGCTGTTCCC TCCTTGAATG CTGAGTGTTT CGTGGAGGGG CTTGCTTGCC TGGCTGCTGA TATGCGGCGG GAGTTGACGA CCGCTGATCC GTCGCTCAGA ATTGATGTGT TTCCTGCCAC GCTTCCGGAC AGGGTGGTTG AGAATGGTGT TGCCGGGAGG CTGCTTAAAG CGCTTTATGC CTGTCCGAAC GGTGTGATGC GAATGAGCGA TGAGATGCCT GGTCTGGTTG AGACATCCAA TAATCTTGCT GTCGTCAGAT CCAAAAATGG ATTGATTTGC GTTGAATGTC TGCTTCGAAG TTCCGTGGAT TCAGCTCGTG ATGATCTGGA ATCGATGATA AGAAGCTGTT TTGAACTCGC AGGAGCCAGT ACGCTCTTCG ATGGAGGATA TCCTGGATGG AAACCGAATC CTGAGTCGGC TATGCTGCAA CGCATGCGGG AAATCTATTG GAAGATGTTT GGTGAAAATC CTGCTATTCA GGCTGTTCAT GCAGGTCTGG AATGCGGAAT TATAGGGGCA ACGTACCCTG AACTCGATAT GATTTCTTTA GGACCGACCA TTCGATACCC CCACTCTCCG GATGAAAAGG TCAATATCGC TTCAGTAGGC GCGTTTATGG ATTTTCTGGT CGAGACGCTC TCCGGAGTTC CCTGCCAATA A
|
Protein sequence | MNDDILSLEP QAVWKHFYGL TRIARPSGNE EKVRRFIAGF GQSLGLETIV DESGNVLIRK PATSGMADRK GVILQAHVDM VPQKNSGTVH DFDKDPIEPI VDGQWVRARG TTLGADNGIG VAAIMAVLES TEMRHGPLEA LFTSNEESGM TGAFGLKSGV LRGSILLNLD SENEGELCIG CAGGLDATMR FSYEEQAVPD GHTGFTISVG GLRGGHSGMD IALGRGNANK ILTRLLSTGY TCHDMLLASI DGGSLRNAIP RESNATVAVP SLNAECFVEG LACLAADMRR ELTTADPSLR IDVFPATLPD RVVENGVAGR LLKALYACPN GVMRMSDEMP GLVETSNNLA VVRSKNGLIC VECLLRSSVD SARDDLESMI RSCFELAGAS TLFDGGYPGW KPNPESAMLQ RMREIYWKMF GENPAIQAVH AGLECGIIGA TYPELDMISL GPTIRYPHSP DEKVNIASVG AFMDFLVETL SGVPCQ
|
| |