Gene Plim_3471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3471 
Symbol 
ID9140189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4489651 
End bp4491783 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content54% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003631483 
Protein GI296123705 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.723449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTGT TTCAAGTCAT GGCACGCACA ACCGGGGTTT TCGCAACTCG ACTGATCTTC 
GCGGGGGCTG TTGTGGCACT GCTGGGGATT CAGGCACAGG CTCAAAACTA TGGGCCCGAT
GAGACCAGAA GTCGCAACGA CTTTCCTGCT TATGAAGGGC GGAATAGCAC CTGGTCGAAT
GATCGATCTG AGCGGACACG CCCGCAGCTG GATCGCCCTG TTGATCCACG CGATGAGATG
TACTGGCCGA ACCCGGTGAG CAATTCGCTT TCTCGCCAGC AGATCGAAGA CCGTTTGTTC
GACGATCGCA GGAACTATGG AGGATTGAAT GACCCTGTCC GATCAAATGA CTGGCAACAG
CCGGCCCGTC GGCCATCGTC GGATCTGATC AGCGACTCCC GCGATCGTGA GCGCGATGCA
TTTCCCTATA GCACTCAGCC CTATGGTACC GGTCGCTATG ATACGGGTCC TAACGACAAT
GGCCGCTACG ATAGCAGTCG CTATGAAACT CGCTATCGCG AAGAGCTGTA CCGTAACGAT
GATTCTCGGA ACCTCCCATT TGATTCCCGT TGGAACAATG CCCGTGATCG CAGCCGAGAT
TTTGTGCCGA CAGCTCCCAG CAGCTGGGAT CGGAACAGCA ACTGGAACAA TAGCAGCAAC
ATGCCGCAGG ATCCCCGGCT GGAACCCGCT CCTGCCACTT TGCAGAAGAA TCACCCCACA
ATCCAGCAGT TGATTTCGAG GCGGTATCGT GATCAGAGAA TCTTGCAGAC TTTAAGCTCG
ATGTCGCCAC AGGCTGCTGA ATCGTTCTAT CTGGAAACGG CTCAACTGAT CGATGCCCGG
GCCTTAGCTC CTTCGGCCTA TCCGGTTCGA ACAGCAAAAG CGTTAGAAAA CCTGTATGTC
GCGGTGGATA ACCAGGAGTT CGTGAATGCC AACCGGTTGC AGGTTGCTCC TCAACAGCGG
GCTGCCTTTC AGCAGGCGAT TTCACAGATC GGTGGACAAG CTCAACCACG CAATGCACAG
GAAGCGATTC AGGTCATGCG GCAGGTGGCA CAAATGAGTC AGCAAATCGT CGGTTTGCGT
CCTCAAGTTG TCGCGATGGA GTTTACCTAT GGTGCACTGG AAACACTCGA TGAATATTCG
ACCTTCATGC CCAATGAAGT CAGTGGCGGG CCCAGTACAC AATTGGGTGA AAGCCTGGTG
GGGATTGGCG TAGAAATCGA AGCTCATCCT TTAGGACTCA AAGTGCTCAA GGCGATTACT
GGCGGGCCGG CAGCACAGGC CACCATCAAG CGAGGCGACA TTATTACGAT GATTGGCGGG
CGGTCGATTG CCGGTATGGA ACTCGATGAA GCGGCCAATC TGATCAAGGG CCCACTCGGA
TCGATGGTGC AGCTTCAGGT GAAACGAGGT GACTACATTG CCGATATGTC GTTGATGCGG
AGCCGAGTGC AGATTCAAAG CGTGGCTGAA GTTCGTATGG AAGATCAGGT CAACAAGGTG
GGGTACATCA AGCTCGACAA GTTTGCCGAA ACAACCAGCC GGGAATTGGA TCAGGCTTTG
ATGAATCTGC ACCAGCAGGG GATGCAATCG CTGATTCTCG ACTTGCGGGG AAACCCCGGC
GGCTTATTGA CCACAGCGAT CGAAGTGACC AACCGGTTTC TGCCAGGTGG AACGATTGTC
AGTACGAAAG GGCGTAACCA GGCGGATAAC AGTCAGGAAG TGGCCAACTA CCCGAATACC
TGGAAAGTGC CACTGGTGGT GCTGATCGAC AACAACAGTG CCAGTGCCAG TGAGATCTTT
GCCGCCGCAA TTCAGGATCA TCAGCGTGGC GTGGTTGTCG GTCAAAGGTC TTATGGTAAA
GGTTCCGTTC AGACACAGTT TCCACTCAAG ACGGTGAATG GAGGATTAAA GCTGACGACG
GCCAAATTCT ACGCTCCTTC GGGCCGGGAA ATGGCAGGTC AGGGTGTGAT TCCAGATGTC
GCAGTTCCTC TGGCACAGAA TGCAATGGAT ACGGTGGATT ACGATATGCA GGCGGCTGTG
AAGCTGGCCA CAGACAGCAC CACCCGCAAC ATGGCTGAGA CGATTGCCAG GCGTTATGCC
CCGCAGAATC AGTTCTTAGG GCAGGCGGGG TAG
 
Protein sequence
MKLFQVMART TGVFATRLIF AGAVVALLGI QAQAQNYGPD ETRSRNDFPA YEGRNSTWSN 
DRSERTRPQL DRPVDPRDEM YWPNPVSNSL SRQQIEDRLF DDRRNYGGLN DPVRSNDWQQ
PARRPSSDLI SDSRDRERDA FPYSTQPYGT GRYDTGPNDN GRYDSSRYET RYREELYRND
DSRNLPFDSR WNNARDRSRD FVPTAPSSWD RNSNWNNSSN MPQDPRLEPA PATLQKNHPT
IQQLISRRYR DQRILQTLSS MSPQAAESFY LETAQLIDAR ALAPSAYPVR TAKALENLYV
AVDNQEFVNA NRLQVAPQQR AAFQQAISQI GGQAQPRNAQ EAIQVMRQVA QMSQQIVGLR
PQVVAMEFTY GALETLDEYS TFMPNEVSGG PSTQLGESLV GIGVEIEAHP LGLKVLKAIT
GGPAAQATIK RGDIITMIGG RSIAGMELDE AANLIKGPLG SMVQLQVKRG DYIADMSLMR
SRVQIQSVAE VRMEDQVNKV GYIKLDKFAE TTSRELDQAL MNLHQQGMQS LILDLRGNPG
GLLTTAIEVT NRFLPGGTIV STKGRNQADN SQEVANYPNT WKVPLVVLID NNSASASEIF
AAAIQDHQRG VVVGQRSYGK GSVQTQFPLK TVNGGLKLTT AKFYAPSGRE MAGQGVIPDV
AVPLAQNAMD TVDYDMQAAV KLATDSTTRN MAETIARRYA PQNQFLGQAG