Gene Paes_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1444 
Symbol 
ID6460226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp1575709 
End bp1577913 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content52% 
IMG OID642725431 
ProductProlyl oligopeptidase 
Protein accessionYP_002016111 
Protein GI194334251 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACT ACTCCTCTGC CATCAGGTCT TTCCTGTTAT GTCTCTTCAT AACCTCACAG 
CCATCCTTTG CCGGCGCAAA GGAGCTGCAG CAGCCGCCGA AGGCTGCTGA GATTGTGGTT
GAGGAAACCA TCTGCGATAC AACGATTGCC GATCCGTTCC GCTATATGGA AAACCTCGAG
AATCCCGTTG TCAAAGAATG GTTCCGCGGT CAGGCAACCT ACGCTCAATC CATGCTGCAG
CGCATTGCCG GACGGGAGGA ACTGATCGAG AAGATGCAGG AGTTTGATCA GCGGAAAAAA
GAGAAAGTCT ACAATCTGAC TATCAGCGAA AGCAACCGCT ACTTCTATCT GAAACGCTCA
CCATCCGACG AAACCGGAAA ACTTTTTTAC CGTAACGGGT TCAAAGGAAA AGAACATCTG
CTTTTTGATC CTGAAACCTA TGCGTCAAAG CCTGAGATAC GCTATTCTAT CGGAAAAATC
GCCCCATCAG ATGATGGGGA AACAATTGTC TTCTCGGTTG CTGCCAACGG TTCGGAAAAT
GCAACACTCC TCATCATGGA TGTGACAACC GCCGAGCTCT ATCCCGAAAA AATTGAACGC
TGCAGATTCG CTTCACCATC ATGGCTCCCC GATGGGAGCG GCTTTCTGTA CAACCGCATG
CGATCCGGCG AAGTACGAAG CATGGATGTT CAGATGGACA GCAAAATCTA TCTTCACACC
ATTGGCACCT CCCCGGTCAG CGACAGGGAG ATTTTTTCAC GAGCGACCAA CCCGGACCTG
CGCACGAGAA AAGAGGATAT CCCCGGAATG CATTACGACA AAAACAGCGG CTGCCTGTTC
GCCTTTGTTC ACAACGTCGA CCAGAGAATG ACCGTCTATT ATGCACCTGC AGATTCGCTC
CGCTCATCAT CCATCCCATG GAAACCGCTT TTCATGCCGG AAGACAATGT CTATGATTTT
GAAATCACCG ACAATGATCT TTACATCCTG ACGCCACAGG ACGCACCGAA CTTCAAGGTC
ATCAGAACAT CGCTTCATGA GCCCGACATC CGGCATGCGC AAACCGTGAT CGCTGAAACA
CCCGATGCGG TTCTGACCGG ATTTTCCCTG ACCAGCGAAG GCATCTACTA CTCACTTTCA
AGAAACGGGG TCCAGGCGGA AGTCTATCGA AAGGATTTCA ACGGAGAACA CCATGAACGC
CTCAAGCTGC CGTTTGCAGC AGGAACCGCA GCGGTCAGCA CAAGAGGGTT CCGCTTCAAG
GATGTCTGGG TGCTTGCAGC AGGCTGGGCC AACGACTACA AACGATACCG TTATGATGCC
GACAGGAAAC GCTTCATCAA GGAAACGCTC TCATCGACTG CCCGCTACCC GGAATACGAT
GATCTTGTCG TAGAGGAACT GATGATCCCC TCGCACGACG GTGTCGAGGT TCCGATCTCG
CTCATCTACA AAAAAGGCCT GAAAAAAGAT GGCACAAACC CCCTGCTCTT TTACGGGTAT
GGCGCCTATG GAAACGCCAT CACCCCATTC TTCAGCCCTG CGTTTCTGCT GTGGACATAC
CATGGAGGAA TTTTTGCAGT CGCTCATGTC CGCGGAGGGG GAGAACTTGG CGACCAATGG
CACAAAGACG GCATGAAAAC CACCAAAAGC AACACCTGGC TCGACCTGAT CAGCCTCGCT
GAATACAGCA TCAGGCAAGG CTATAGCTCG CCTGAGCACA TCGCTGTCAA CAGCGCAAGC
GCGGGAGGAA TCCTCGTCGG ACGGGCAATG ACCCAACGGC CGGAGCTCTT TGCCGCCGTC
ATACCGCAGG TGGGCGCGAT GAACCCGCTC CGGGGCGAAA ACACACCAAA CGGACCAGTC
AACGCACCTG AATTCGGCAC CGTCAACAAT CCACGCGAAT GCAGAGCACT GATCACTATG
GATCCGTATC TCAACATTCG AAAAGGGATT GACTACCCTG CAGCGCTCAT TACAGCAGGC
ATCAACGACC CGAGGGTTAT CGCATGGCAG CCGGCCAAAT TTGCCGCAAG ACTGCAGGCC
GCTACGACAT CCGGCAAACC GGTACTCTTT CTGACAGACT ACCAGGCCGG CCACGGCATG
GGCAATACAA AGACAAAACA GTTCGAAACA CTTGCCGATG TTCTCAGCTT CGCATTCTGG
CAGACAGGCC AGGAGGATTT TCAGCCATCA TCAACCACCA AATAA
 
Protein sequence
MKNYSSAIRS FLLCLFITSQ PSFAGAKELQ QPPKAAEIVV EETICDTTIA DPFRYMENLE 
NPVVKEWFRG QATYAQSMLQ RIAGREELIE KMQEFDQRKK EKVYNLTISE SNRYFYLKRS
PSDETGKLFY RNGFKGKEHL LFDPETYASK PEIRYSIGKI APSDDGETIV FSVAANGSEN
ATLLIMDVTT AELYPEKIER CRFASPSWLP DGSGFLYNRM RSGEVRSMDV QMDSKIYLHT
IGTSPVSDRE IFSRATNPDL RTRKEDIPGM HYDKNSGCLF AFVHNVDQRM TVYYAPADSL
RSSSIPWKPL FMPEDNVYDF EITDNDLYIL TPQDAPNFKV IRTSLHEPDI RHAQTVIAET
PDAVLTGFSL TSEGIYYSLS RNGVQAEVYR KDFNGEHHER LKLPFAAGTA AVSTRGFRFK
DVWVLAAGWA NDYKRYRYDA DRKRFIKETL SSTARYPEYD DLVVEELMIP SHDGVEVPIS
LIYKKGLKKD GTNPLLFYGY GAYGNAITPF FSPAFLLWTY HGGIFAVAHV RGGGELGDQW
HKDGMKTTKS NTWLDLISLA EYSIRQGYSS PEHIAVNSAS AGGILVGRAM TQRPELFAAV
IPQVGAMNPL RGENTPNGPV NAPEFGTVNN PRECRALITM DPYLNIRKGI DYPAALITAG
INDPRVIAWQ PAKFAARLQA ATTSGKPVLF LTDYQAGHGM GNTKTKQFET LADVLSFAFW
QTGQEDFQPS STTK