Gene Paes_1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1222 
Symbol 
ID6459853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1330536 
End bp1331996 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content52% 
IMG OID642725212 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_002015897 
Protein GI194334037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.121021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000871052 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGATG ATATTCTTTC CTTAGAGCCC CAGGCGGTAT GGAAACATTT TTACGGCCTG 
ACCCGGATAG CCCGTCCTTC AGGCAACGAG GAGAAAGTCC GGAGATTTAT TGCTGGTTTC
GGGCAGTCTC TTGGATTGGA AACGATTGTT GACGAGTCGG GTAACGTTCT TATCCGTAAA
CCGGCGACAT CCGGCATGGC TGATCGGAAG GGCGTTATTC TTCAGGCTCA TGTTGATATG
GTTCCCCAGA AGAACAGTGG TACGGTTCAC GATTTTGACA AAGATCCTAT TGAACCCATT
GTCGATGGTC AATGGGTGCG GGCAAGGGGA ACGACGCTCG GAGCCGATAA CGGCATTGGC
GTTGCAGCGA TCATGGCTGT GCTGGAATCG ACGGAAATGC GTCATGGCCC TCTTGAAGCG
CTCTTTACCA GTAACGAAGA GAGCGGTATG ACCGGTGCAT TCGGACTGAA ATCCGGTGTG
CTCAGGGGCA GCATTCTTCT CAACCTGGAT TCGGAAAATG AGGGAGAGCT GTGTATCGGT
TGCGCTGGCG GCCTCGATGC AACGATGAGG TTCAGCTACG AAGAGCAGGC CGTCCCGGAT
GGACACACTG GGTTCACGAT CAGTGTGGGC GGTTTGAGAG GCGGTCACAG TGGCATGGAT
ATTGCACTTG GGCGCGGCAA TGCAAACAAG ATCCTGACCC GCCTGCTCTC TACCGGATAT
ACATGTCATG ATATGCTGTT GGCCTCGATC GACGGAGGTA GCCTTCGCAA TGCAATTCCC
CGAGAATCGA ATGCAACGGT CGCTGTTCCC TCCTTGAATG CTGAGTGTTT CGTGGAGGGG
CTTGCTTGCC TGGCTGCTGA TATGCGGCGG GAGTTGACGA CCGCTGATCC GTCGCTCAGA
ATTGATGTGT TTCCTGCCAC GCTTCCGGAC AGGGTGGTTG AGAATGGTGT TGCCGGGAGG
CTGCTTAAAG CGCTTTATGC CTGTCCGAAC GGTGTGATGC GAATGAGCGA TGAGATGCCT
GGTCTGGTTG AGACATCCAA TAATCTTGCT GTCGTCAGAT CCAAAAATGG ATTGATTTGC
GTTGAATGTC TGCTTCGAAG TTCCGTGGAT TCAGCTCGTG ATGATCTGGA ATCGATGATA
AGAAGCTGTT TTGAACTCGC AGGAGCCAGT ACGCTCTTCG ATGGAGGATA TCCTGGATGG
AAACCGAATC CTGAGTCGGC TATGCTGCAA CGCATGCGGG AAATCTATTG GAAGATGTTT
GGTGAAAATC CTGCTATTCA GGCTGTTCAT GCAGGTCTGG AATGCGGAAT TATAGGGGCA
ACGTACCCTG AACTCGATAT GATTTCTTTA GGACCGACCA TTCGATACCC CCACTCTCCG
GATGAAAAGG TCAATATCGC TTCAGTAGGC GCGTTTATGG ATTTTCTGGT CGAGACGCTC
TCCGGAGTTC CCTGCCAATA A
 
Protein sequence
MNDDILSLEP QAVWKHFYGL TRIARPSGNE EKVRRFIAGF GQSLGLETIV DESGNVLIRK 
PATSGMADRK GVILQAHVDM VPQKNSGTVH DFDKDPIEPI VDGQWVRARG TTLGADNGIG
VAAIMAVLES TEMRHGPLEA LFTSNEESGM TGAFGLKSGV LRGSILLNLD SENEGELCIG
CAGGLDATMR FSYEEQAVPD GHTGFTISVG GLRGGHSGMD IALGRGNANK ILTRLLSTGY
TCHDMLLASI DGGSLRNAIP RESNATVAVP SLNAECFVEG LACLAADMRR ELTTADPSLR
IDVFPATLPD RVVENGVAGR LLKALYACPN GVMRMSDEMP GLVETSNNLA VVRSKNGLIC
VECLLRSSVD SARDDLESMI RSCFELAGAS TLFDGGYPGW KPNPESAMLQ RMREIYWKMF
GENPAIQAVH AGLECGIIGA TYPELDMISL GPTIRYPHSP DEKVNIASVG AFMDFLVETL
SGVPCQ