Gene Ping_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPing_1037 
Symbol 
ID4623897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePsychromonas ingrahamii 37 
KingdomBacteria 
Replicon accessionNC_008709 
Strand
Start bp1308678 
End bp1310054 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content43% 
IMG OID639796253 
Productprotease Do 
Protein accessionYP_942475 
Protein GI119944795 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAA AATCAGCTCG GTTATTAGTG ATGATGCTTT GCCTGAGTTT AAGTTACAGC 
GTAAATTCAG CAATACCTAA TGTTACGGCC CAGGGAAAAC CTTTTCCATC TCTTGCACCA
ATGTTGAAAA AAGTGAATTC CGCAGTGGTT AATATCGCCA CCTATTCAAC CAAACAGGCG
TTAGTCAATC CCTTATTAAA TGACCCTTTT TTCAGGCACT TCTTTGATAT CCCAGAGCAG
GGAAATTTAC AACAGAAAGC ACCTCAAAAG CGCCAGAAAA GTGCCGGTTC CGGGGTGATA
ATCAATGCTC AAGATGGCAT CGTCATGACC AATTACCATG TAATAAAAGA CGCTGATGAA
GTGCAGGTTT CATTGATCGA CGGACGCAGT TATAAAGCGA AAATAATAGG TTCAGACCCA
GAATTGGATA TTGCTATACT GTCAATAAAA GCTAAAAAAT TGACGCAGGT AACAATATCC
GAGTCGATTC TTTTAGAAGT TGGTGATTTT GTCGTTGCCA TTGGCAATCC CTTTGGTTTA
GGTCAAACAG TGACTACGGG TATTGTCAGT GCGCTCGGAC GCAGTGGTTT AGGGATAGAA
GGTTATGAAA ACTTTATCCA AACCGATGCC TCTATTAATC CGGGTAATTC CGGGGGAGCA
CTGGTTAATT TGAATGGTGA ATTGGTCGGT ATCAATACGG CTATTATTGC GCCGGCAGGC
GGGAATATAG GAATAGGATT TGCCATTCCA ATCAATATGG CTGAAGCAAG CATGCAGAAA
ATCATAAAAT ACGGTGAGGT TAAACGCGGT CAGATTGGTG TTTCTATTCA GGATATTACG
CGAGATTTAA GCGAGGCACT GGGCCTTGAA AATGGTCAAC TGGGTGTACT TGTTGCGGGT
GTAACAAAAG GCTCTCCCGC TGAAAAAGCG GGGTTAATCC CGGGCGATAT TATCACCGGT
GTTGATGGTC AACTAACCAA ATCTACCGGA CAGTTACGTA GCCAAATAGG CTTGAAAAGT
ATCGGTGATA CGGTCAAGGT TACATTATTG CATAATGGCA TTAAAAAAAC AGTGGATGTT
GGCATTGGCA AACCACAAAC CTTGAGTACT GCAGATTCAT CCGGAGAATT GCACCCTTTA
CTTGAGGGCG TTTCATTTGC CAATAATAAC AAGGGACAAG GTGTAAAGGT CACCGAAGTA
CAGGCAAATT CTCCGGCCGC CTACAGCGGG TTAGAGGAAG GTGATCTGAT TATCGCCACG
AATAAACATA GCGTTGATGA TTTATTATCC TTTAAAAAAG CGCTGGGACT TGCTAAGAAC
AGCATCTTAT TACAGGTCAA CCGCAATGGA ATGTCTTTAT TTATTGCCAT TCGTTAA
 
Protein sequence
MKIKSARLLV MMLCLSLSYS VNSAIPNVTA QGKPFPSLAP MLKKVNSAVV NIATYSTKQA 
LVNPLLNDPF FRHFFDIPEQ GNLQQKAPQK RQKSAGSGVI INAQDGIVMT NYHVIKDADE
VQVSLIDGRS YKAKIIGSDP ELDIAILSIK AKKLTQVTIS ESILLEVGDF VVAIGNPFGL
GQTVTTGIVS ALGRSGLGIE GYENFIQTDA SINPGNSGGA LVNLNGELVG INTAIIAPAG
GNIGIGFAIP INMAEASMQK IIKYGEVKRG QIGVSIQDIT RDLSEALGLE NGQLGVLVAG
VTKGSPAEKA GLIPGDIITG VDGQLTKSTG QLRSQIGLKS IGDTVKVTLL HNGIKKTVDV
GIGKPQTLST ADSSGELHPL LEGVSFANNN KGQGVKVTEV QANSPAAYSG LEEGDLIIAT
NKHSVDDLLS FKKALGLAKN SILLQVNRNG MSLFIAIR