Gene Plim_3737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3737 
Symbol 
ID9140455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4803367 
End bp4804566 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content55% 
IMG OID 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003631748 
Protein GI296123970 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACCA GAGTGATATT TGCCAGTTTC ATTTCGGCCC TGGCCGCCAG TGTGCTCACA 
GCCTGGTGGC TGCAACCATC CACTTTCGAG GCCCACGGTC TGGCTGTGGC CCAGGACCGC
GGTGCCCGAT TCAAATCGAC CCCTTACAGC GAAGAAGATG AAGCCAATCA ACCGAAGGTC
AAGCCCATCT ATATGGCGGA TGGTCTGACG CCCGAAGAAG CAGTGAATAT TGCGGTCTAT
CAGGCCGCGA ACCGCAGTGT GGTGAACATT ACGACAAAAG CGGTGCAATC GGGCCGGTTC
TCGCTGCTCG AACTGCAATC CGAAGGAAGT GGTTCGGGTT CGATCATTGA TAAAGCTGGT
CATGTGCTGA CCAATAATCA CGTTGTCGAA GGGGCCACAC AGATCAGTGT AACGCTCTAT
TCGGGCGAAT CATTCGATGC CACCATCGTG GGAGCTGATC CTGTCAACGA TATTGCCATC
CTGAAGCTCG AAGCTCCAGA AGATCAGCTC TATCCCGTGG AATTCGGCGA TTCGCGAAAA
CTTCGAGCCG GGATGCGGGT CTTTGCACTG GGAAACCCGT TTGGTCTCGA ACGCACCTTA
ACCACAGGGA TTATATCGAA TCTCAATCGA TCGTTGCAGA TTCATGGCAA TCGCACGATC
CGCTCGATCA TTCAGATCGA TGCCGCCATC AATCCCGGAA ACAGCGGCGG GCCTCTGCTC
GATGCCCACG GCAAGCTGAT CGGTATTAAC ACCGCCATTG CCACGACATC GGGTCAAAGT
GCCGGTGTTG GGTTTGCGAT TCCTGTGAAT CTGGTCACGC GGGTGGTGCC GCAATTGCTC
GCTTATGGGA AAGTGGTGCG CCCTGAAGTG GGGATTACCA AAGTCTTTGA GACGGAGAAA
GGCCTGCTGA TTGCTCAAAT GAAGCCCGGC GGGCCAGCAG AACGTGCCGG GTTACGAGGC
CCGAAAGTCG TTCGCGCCCG GCGTGGGCCA TTCATCAGTG AATCTGTCGA TCGTGCGGCT
GCCGACCGAA TTATGGCGGT TGATGGTCGC AAGATTTCGA CGGCTGATGA TTTTCTTGGT
TATGTCGAAG ATAAAAAGCC GGGCGATGTC GTTCGTTTGA CGATCGTTCG CGATGGTCAG
GAAATCGAGG TGCCACTGAC ACTCACGACG ACTGATCCCC TGGGTGAGCG AACTCCTTAG
 
Protein sequence
MMTRVIFASF ISALAASVLT AWWLQPSTFE AHGLAVAQDR GARFKSTPYS EEDEANQPKV 
KPIYMADGLT PEEAVNIAVY QAANRSVVNI TTKAVQSGRF SLLELQSEGS GSGSIIDKAG
HVLTNNHVVE GATQISVTLY SGESFDATIV GADPVNDIAI LKLEAPEDQL YPVEFGDSRK
LRAGMRVFAL GNPFGLERTL TTGIISNLNR SLQIHGNRTI RSIIQIDAAI NPGNSGGPLL
DAHGKLIGIN TAIATTSGQS AGVGFAIPVN LVTRVVPQLL AYGKVVRPEV GITKVFETEK
GLLIAQMKPG GPAERAGLRG PKVVRARRGP FISESVDRAA ADRIMAVDGR KISTADDFLG
YVEDKKPGDV VRLTIVRDGQ EIEVPLTLTT TDPLGERTP