Gene CPS_4143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_4143 
Symbol 
ID3518754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp4358942 
End bp4359991 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content38% 
IMG OID637286586 
Producttrypsin family protein 
Protein accessionYP_270797 
Protein GI71278654 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.483175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGAG CTCTTCTATT TCTTAGTATT TTCTTCTTAA GTGGCTGTGG AACACCAATC 
CAAGCATTTC AAGTCAAAGA AAATTCAATT CAAACACTAT ATAAGAAGGT CAACCCTTCA
GTGGTTGAAT TACATGTACA ATCACTAGCT GATCCAAAAA TTGGTCAAGT AGCATATAAA
GCAAAAACTG CAAATTCATT AGGGTCAGGA GCTTTAGTAA GTAGCGAAGG TCGTATTTTA
ACAGCAGCTC ATGTTGTTGA TAAAGCAACA GCAATTGAAG TTGAATTTGC TGATGGCACT
AAAACTACCG GTCATGTTGT TTGGGTAGAG CCGCTTATTG ATTTGGCGAT GATTCAGGCT
GGGGAAGTTC CTAGCACAGC TAAACCATTA AAATTAGCTA AAAGTAACGA TTATCAAATT
GGTGAACAAG TTATTATTAT TGGTGCACCT TTTGGTGTTA GCCACAGTTT ATCTGTTGGC
TATCTAAGTG GTATTCGTGA CGGTAACGCA ATACCGGGCA GAACCTTAGT GCCACGTTTA
TTACAAACTG ATGCTTCAAT TAACCAAGGT AATTCTGGCG GTCCAATGTT TAACCTCAAT
GGTGAAATTG TTGGTATTGT TAGCCATATA TTATCTAAAA GTGGTGGCAG CAATGGCTTA
GGTTTTGTTG TTTCAGTTGA TACCGTACGC CATATAATTG ATAGTGACCC AGGTACATTC
TCGGGTTTTA TTCCATTGTT ACTTAATAAA AAACAGTCGT ATGCTATTAA CAATACAGCA
GGCCACGGCA TGTTAATTCA GCATGTTATA CCAGGTACTT TAGCAGATAA ATTAGGCTTT
AAAGGTGGTA ACCTTAGTGT TGTCATTGGT CGCAGTCCTA TTTTACTTGG TGGTGACATA
CTGCTGGAAG TGGGCGGTCG TGCTATTATT GATTTGGCAT CAGCGGTTCA AATTAAGAAG
CATCTAGCTA CCTTTGAAAA AGGCGATAGA GTTACTTTTA AATATTTACG CAATGGTCAA
AAGAAAGAAA CTTATTGGAT AGTTGAGTAG
 
Protein sequence
MSRALLFLSI FFLSGCGTPI QAFQVKENSI QTLYKKVNPS VVELHVQSLA DPKIGQVAYK 
AKTANSLGSG ALVSSEGRIL TAAHVVDKAT AIEVEFADGT KTTGHVVWVE PLIDLAMIQA
GEVPSTAKPL KLAKSNDYQI GEQVIIIGAP FGVSHSLSVG YLSGIRDGNA IPGRTLVPRL
LQTDASINQG NSGGPMFNLN GEIVGIVSHI LSKSGGSNGL GFVVSVDTVR HIIDSDPGTF
SGFIPLLLNK KQSYAINNTA GHGMLIQHVI PGTLADKLGF KGGNLSVVIG RSPILLGGDI
LLEVGGRAII DLASAVQIKK HLATFEKGDR VTFKYLRNGQ KKETYWIVE